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INTRODUCTION 


Proliferation  of  mammary  and  uterine  tissues  is  governed  in  large  part  by  the  steroid  hormones  estrogen 
and  progesterone  (1).  These  hormones  trigger  activation  of  genes  that  regulate  the  cell  cycle  (3,  5). 
Hormone  binding  to  specific  receptor  proteins  causes  translocation  of  the  receptor  to  the  nucleus.  The 
receptor  then  binds  to  response  elements  upstream  of  target  genes  to  activate  their  expression. 

Activation  by  the  steroid  hormone  receptors  requires  several  cofactors,  including  the  CREB  binding 
protein  (CBP)  or  the  highly  related  p300  protein,  as  well  as  SRC-1,  PCAF,  and  ACTR  (2, 3, 4,  9). 
Strikingly,  all  of  these  cofactors  possess  histone  acetyltransferase  activity.  The  histone  proteins  fold 
DNA  into  a  highly  compacted  structure  called  chromatin.  This  folding  is  generally  inhibitory  to 
transcription.  Post-translational  modification  of  the  histones,  including  acetylation,  modulates  chromatin 
folding  and  is  therefore  key  to  the  regulation  of  gene  expression  (7,  8).  Histone  acetylation  levels  are 
governed  by  the  opposing  actions  of  two  groups  of  en2ymes,  the  histone  acetyltransferases  and  the 
histone  deacetylases.  Experiments  supported  by  this  IDEA  grant  are  focused  on  understanding  the  role  of 
the  PCAF  histone  acetyltransferase  in  steroid  hormone  responses  in  normal  mammary  development  and 
in  abnormal  events  that  lead  to  cancer.  Our  original  Specific  Aims  were  to  1)  complete  the  isolation  and 
cloning  of  mouse  PCAF  cDNA  and  genomic  sequences  2)  to  characterize  murine  PCAF  as  a  histone 
acetyltransferase  and  3)  examine  PCAF  interactions  with  specific  steroid  receptors  and  CBP/p300  4)  to 
examine  whether  acetylase  activity  is  required  for  transcriptional  activation  by  the  hormone  receptors  5) 
to  create  mice  deficient  for  PCAF  and  mice  that  overexpress  this  factor  to  assess  the  role  of  PCAF  in 
mouse  development,  differentiation,  and  tumorigenesis.  Our  progress  towards  these  goals  is  described 
below. 


BODY 


Specific  Aim  1:  to  complete  the  isolation  and  cloning  of  mouse  PCAF  cDNA  and  genomic 
sequences.  This  specific  aim  is  completed.  Cloning  of  the  mouse  PCAF  cDNA  and  genomic  sequences 
was  reported  in  our  paper  published  in  Molecular  and  Cellular  Biology  in  October  of  last  year  (Xu  et  al, 
1998;  attached  in  appendix;  see  Figs.  1,  2,  and  3).  This  paper  also  reports  the  cloning  of  a  highly  related 
gene  encoding  the  GCN5  histone  acetyltransferase.  Interestingly,  GCN5  is  genetically  linked  to  the 
BRCAl  locus. 

Specific  Aim  2:  to  characterize  murine  PCAF  as  a  histone  acetyltransferase.  This  specific  aim  is 
also  completed,  and  the  results  are  reported  in  Figs.  6  and  7  of  the  Xu  et  al,  1998  manuscript  referenced 
above  (attached  in  appendix).  We  also  characterized  the  acetyltransferase  activity  of  GCN5  in  this 
paper.  We  found  that  both  enzymes  specifically  acetylate  lysine  14  in  histone  H3.  They  will  also 
acetylate  histone  H4,  but  at  a  much  decreased  level.  Importantly,  both  enzymes  recognize  H3  in  the 
context  of  a  nucleosome,  the  physiological  substrate. 

Specific  Aim  3:  to  examine  PCAF  interactions  with  specific  steroid  receptors  and  CBP/p300.  We 

and  others  have  described  the  binding  of  PCAF  to  CBP/p300  (  Xu  et  al,  1998  and  (9)).  Tissue  culture 
experiments  have  been  performed  by  others  to  examine  the  importance  of  PCAF  to  hormone  responses 
(2)  and  they  have  found  that  PCAF  augments  estrogen  responses.  Our  basic  premise,  then,  that  PCAF  is 
important  for  these  processes  is  supported  by  these  cell  culture  results.  However,  our  results  below  with 
the  PCAF  knock  out  mice  indicate  that  PCAF  is  not  required  for  these  processes  in  whole  animals. 
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Therefore,  we  have  completed  all  the  relevant  experiments  in  this  aim  and  PCAF  interactions  with  the 
hormone  receptors  will  not  be  pursued  further. 

Specific  Aim  4:  to  examine  whether  PCAF  acetylase  activity  is  required  for  transcriptional 
activation  by  the  hormone  receptors.  Again  these  experiments  are  superceded  by  our  findings  with  the 
PCAF  knockout  mice  below,  which  indicate  PCAF  is  not  needed  for  hormone  responses  in  mice.  We 
now  propose  to  modify  this  aim  to  make  acetylase  minus  forms  of  GCN5  for  gene  replacement  studies  in 
mice.  Such  studies  will  be  less  artificial  and  more  informative  than  cell  culture  experiments. 

Specific  Aim  5:  to  create  mice  deficient  for  PCAF  and  mice  that  overexpress  this  factor  to  assess 
the  role  of  PCAF  in  mouse  development,  differentiation,  and  tumorigenesis.  The  first  half  of  this 
aim,  to  create  PCAF  deficient  mice  is  completed.  Since  PCAF  and  GCN5  are  very  highly  related,  we 
also  created  mice  deficient  for  GCN5.  Our  creation  and  analysis  of  these  mice  is  reported  in  a 
manuscript  recently  submitted  for  publication  (Xu  et  al,  1999;  attached  in  appendix).  We  found  that  loss 
of  PCAF  has  no  adverse  effects  on  mouse  development  (see  Fig.  1  in  Xu  et  al,  1999).  Neither  did  we 
find  any  abnormal  phenotype  associated  with  PCAF  loss  in  adult  mice.  Estrogen-regulated  processes 
such  as  pregnancy  and  lactation  are  also  unaffected.  In  contrast,  loss  of  the  highly  related  GCN5  protein 
causes  embryonic  lethality  (see  Fig.  2,  Xu  et  al,  1999).  We  determined  that  this  lethality  is  due  to  an 
increase  in  programmed  cell  death  of  particular  mesodermal  lineages  (see  Fig.  6,  Xu  et  al,  1999).  These 
exciting  results  suggest  that  GCN5  may  serve  to  regulate  apoptosis,  and  thus  is  likely  be  important  in 
regulation  of  tumor  formation  or  growth. 

Accomplishments  towards  original  Statement  of  Work: 

Year  1 :  All  goals  for  year  1  are  complete  except  for  the  last  one,  to  clone  an  MMTV-PCAF  transgene. 
This  goal  is  no  longer  relevant  since  our  PCAF  deficient  mice  exhibit  no  phenotype.  We  propose  instead 
to  redirect  our  efforts  to  create  mice  that  express  PCAF  in  place  of  GCN5,  to  determine  whether  it  can 
rescue  the  lethality  of  the  GCN5  deficient  mice.  If  so,  then  we  will  conclude  that  PCAF  is  redundant  in 
function  to  GCN5  (and  that  the  difference  in  the  mouse  phenotypes  simply  reflects  a  difference  in 
expression  patterns)  and  that  PCAF  too  is  likely  to  regulate  apoptosis.  Therefore,  both  proteins  may  be 
very  important  in  tumor  formation.  We  also  propose  to  create  a  breast  specific  knock  out  of  the  GCN5 
gene  to  see  if  it  has  an  effect  on  mammary  development  or  tumor  formation.  Given  the  data  we  now  have 
in  hand,  these  experiments  will  much  more  informative  and  have  more  potential  for  importance  to  breast 
cancer  than  did  the  original  goal. 

Year  2:  All  year  2  goals  are  also  complete  except  for  the  last  one,  which  again  involves  the  MMTV- 
PCAF  transgene  that  is  no  longer  relevant.. 

Year  3:  All  year  3  goals  are  in  progress,  again  with  the  exception  of  the  MMTV  transgene  experiments. 
We  are  now  cloning  ‘knock  in  vectors’  for  PCAF  and  GCN5  that  will  contain  either  wild  type  or 
acetylase  minus  versions  of  these  genes  (goal  1).  We  are  monitoring  the  PCAF  knock  out  mice  for 
tumors  (goal  2).  Goal  3,  to  examine  mammary  tissue  in  the  PCAF  knock  out  mice,  was  actually 
completed  last  year.  We  are  beginning  the  design  and  cloning  of  the  breast-specific  knock  out  vector  for 
GCN5  (revised  goal  4).  When  these  mice  are  created,  we  will  examine  the  expression  and  functions  of 
the  steroid  receptors  (goalS). 


KEY  RESEARCH  ACCOMPLISHMENTS 


Reported  in  Xu  et  al,  1998: 

•  Isolation  and  cloning  of  the  murine  PCAF  and  GCN5  genes. 
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•  Determination  that  both  PCAF  and  GCN5  interact  with  CBP/p300 

•  Determination  that  GCN5  exists  in  a  long  and  short  form  in  mammalian  cells 

•  Determination  that  GCN5  and  PCAF  have  identical  substrate  specificities 

•  Determination  that  GCN5  and  PCAF  are  expressed  in  complementary  patterns  in  adult  mice 

•  Determination  that  GCN5  is  genetically  linked  to  the  BRCAl  locus 

Reported  in  Xu  et  al,  1999 

•  Determination  that  PCAF  is  dispensable  in  mice 

•  Determination  that  GCN5  is  essential  for  mouse  development 

•  Detemination  that  specific  mesodermal  lineages  are  missing  in  the  GCN5  mutant  mice 

•  Determination  that  loss  of  mesodermal  lineages  in  the  absence  of  GCN5  is  due  to  a  25  fold  increase 
in  apoptosis  in  these  tissues. 


REPORTABLE  OUTCOMES 


Manuscripts: 

1) .  Xu,  W.,  Edmondson,  D.  G.,  and  Roth,  S.Y.  (1998).  Mammalian  GCN5  and  PCAF  acetyltransferases 
have  homologous  aminoterminal  domains  important  for  recognition  of  nucleosomal  substrates. 
Molecular  and  Cellular  Biology  18,  5659-5669 

2)  Xu,  W.,  Edmondson,  D.G.,  Evrard,  Y.A.,  Wakamiya,  M.,  Behringer,  R.R.,  and  Roth,  S.Y.  (1999) 

Loss  of  GCN5  histone  acetyltransferase  leads  to  increased  apoptosis  and  lineage  specific  mesodermal 
defects  during  mouse  development.,  submitted  for  publication. 

Degrees  obtained: 

The  Ph.D.  thesis  research  of  my  student.  Wanting  Xu,  was  supported  largely  by  this  grant.  Dr.  Xu 
completed  her  Ph.D.  in  May,  1999. 

Funding  Applied  for: 

The  current  grant  expires  in  June,  2000.  To  continue  and  extend  this  research,  I  have  applied  for  another 
IDEA  award  entitled  “Role  of  GCN5  in  estrogen  response,  tumor  suppression,  and  breast  development  in 
mice’. 


CONCLUSIONS 

We  discovered  that  the  PCAF  gene  is  very  highly  related  in  structure  and  in  function  to  the  GCN5  gene. 
However,  loss  of  these  two  genes  has  very  different  consequences  in  the  mouse.  PCAF  is  apparently 
dispensable.  This  result  is  especially  surprising  because  a  number  of  studies  in  tissue  culture  cells 
indicate  that  PCAF  is  important  to  hormone  responses,  including  estrogen  responses,  and  to  muscle  cell 
differentiation.  The  absence  of  abnormalities  in  hormone  dependent  processes  in  PCAF  deficient  mice 
indicates  either  that  PCAF  is  not  important  for  these  processes  or  that  redundant  activities,  like  GCN5, 
‘fill  in’  when  PCAF  is  missing.  In  contrast,  loss  of  GCN5  results  in  embryonic  death  due  to  an  increase 
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in  apoptosis.  This  finding  further  supports  the  idea  that  GCN5  and  PCAF  may  be  important  for 
regulation  of  tumorigenesis,  since  programmed  cell  death  is  often  altered  in  tumor  cells.  We  have  also 
found  that  GCN5  is  genetically  linked  to  BRCAl  in  mouse.  Others  have  found  this  same  linkage  in 
humans.  This  region  of  the  human  genome  often  exhibits  loss  of  heterozygosity  in  spontaneous  breast 
and  ovarian  cancers,  which  do  not  exhibit  mutations  in  BRCAl  or  BRCA2.  Future  experiments  will  be 
directed  at  understanding  how  GCN5  regulates  apoptosis,  how  this  regulation  might  be  tied  to 
tumorigenesis,  and  how,  or  if,  GCN5  is  important  to  mammary  development.  In  addition,  we  will 
genetically  probe  whether  PCAF  can  replace  GCN5  in  the  mouse.  These  experiments  are  important  for 
discerning  how  histone  acetyltransferases  contribute  to  hormone  responses  and  apoptosis,  and  thus  may 
yield  invaluable  information  for  the  prevention  or  treatment  of  hormone  responsive  breast  cancers. 
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The  yeast  transcriptional  adapter  GcnSp  serves  as  a  histone  acetyltransferase,  directly  linking  chromatin 
modification  to  transcriptional  regulation.  Two  human  homologs  of  GcnSp  have  been  reported  previously, 
hsGCNS  and  hsP/CAF  (p300/CREB  binding  protein  [CBP] -associated  factor).  While  hsGCNS  was  predicted  to 
be  close  to  the  size  of  the  yeast  acetyltransferase,  hsP/CAF  contained  an  additional  356  amino-terminal 
residues  of  unknown  function.  Surprisingly,  we  have  found  that  in  mouse,  both  the  GCN5  and  the  PICAF  genes 
encode  proteins  containing  this  extended  amino-terminal  domain.  Moreover,  while  a  shorter  version  of  GCN5 
might  be  generated  upon  alternative  or  incomplete  splicing  of  a  longer  transcript,  mRNAs  encoding  the  longer 
protein  are  much  more  prevalent  in  both  mouse  and  human  cells,  and  larger  proteins  are  detected  by 
GCNS-specific  antisera  in  both  mouse  and  human  cell  extracts.  Mouse  GCN5  (mmGCNS)  and  mmPjCAF  genes 
are  ubiquitously  expressed,  but  maximum  expression  levels  are  found  in  different,  complementary  sets  of 
tissues.  Both  mmP/CAF  and  mmGCNS  interact  with  CBP/p300.  Interestingly,  mmGCNS  maps  to  chromosome 
11  and  cosegregates  with  BRCAl^  and  mmPjCAF  maps  to  a  central  region  of  chromosome  17.  As  expected, 
recombinant  mmGCNS  and  mmP/CAF  both  exhibit  histone  acetyltransferase  activity  in  vitro  with  similar 
substrate  specificities.  However,  in  contrast  to  yeast  GcnSp  and  the  previously  reported  shorter  form  of 
hsGCNS,  mmGCNS  readily  acetylates  nucleosomal  substrates  as  well  as  free  core  histones.  Thus,  the  unique 
amino-terminal  domains  of  mammalian  P/CAF  and  GCNS  may  provide  additional  functions  important  to 
recognition  of  chromatin  substrates  and  the  regulation  of  gene  expression. 


Transcription  is  a  complex  process  requiring  the  coordinate 
action  of  multiple  basal  and  transactivating  proteins.  In  eu¬ 
karyotic  cells,  this  process  is  complicated  further  by  the  pack¬ 
aging  of  DNA  into  chromatin.  Nucleosomes  provide  the  fun¬ 
damental  repeat  unit  of  chromatin,  consisting  of  two  molecules 
of  each  of  the  four  core  histones  (H2A,  H2B,  H3,  and  H4)  and 
—146  bp  of  DNA  wound  in  almost  two  turns  around  the  exte¬ 
rior  of  the  histone  octamer  (37).  Individual  nucleosomes  as 
well  as  more  highly  folded  structures  are  generally  inhibitory  to 
the  initiation  of  transcription.  Alterations  in  nucleosomal 
structure  and  in  chromatin  packing  often  accompany  transcrip¬ 
tional  activation  (12). 

Posttranslational  acetylation  of  the  histones  has  long  been 
correlated  with  transcriptional  activation  (36,  39,  40).  Acetyla¬ 
tion  neutralizes  the  charge  associated  with  epsilon  amino 
groups  of  lysine  residues,  thereby  loosening  contacts  between 
the  histones  and  the  negatively  charged  DNA.  Histone  acety¬ 
lation  also  influences  compaction  of  nucleosomal  arrays,  yield¬ 
ing  less  condensed  chromatin  structures  (16).  Both  of  these 
effects  can  increase  transactivator  binding  to  nucleosomal 
DNA,  facilitating  transcriptional  activation. 

A  molecular  basis  for  the  linkage  between  histone  acetyla¬ 
tion  and  gene  activation  was  provided  by  the  discovery  that  the 
yeast  transcriptional  adapter  GcnSp  serves  as  the  catalytic  sub¬ 
unit  of  a  histone  acetyltransferase  type  A  activity  (5).  GcnSp  is 
associated  with  two  multisubunit  complexes  in  yeast,  which 
include  Ada  proteins  (Ada2p,  Ada3p,  and  AdaSp)  and/or  cer- 
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tain  Spt  proteins  (6,  14,  18,  24,  25,  31).  These  complexes  are 
required  for  transcriptional  activation  by  particular  transacti¬ 
vators,  including  heterologous  VP  16  derivatives  and  endoge¬ 
nous  Gcn4p  (3,  13,  24,  34).  Components  of  the  GcnSp- Adap 
complex  contact  both  transactivator  proteins  and  basal  tran¬ 
scription  proteins,  thus  providing  an  adapter  or  coactivator 
function,  in  addition  to  histone  acetyltransferase  activity  (2, 
34).  Association  with  both  Ada2p  and  acetyltransferase  activity 
is  required  for  GcnSp  function  in  vivo  (8,  38). 

Human  homologs  of  GCNS  have  been  cloned  based  on 
sequence  and  functional  similarities  of  their  predicted  prod¬ 
ucts  to  the  yeast  protein.  A  cDNA  predicted  to  encode  a 
protein  of  similar  size  and  with  overall  homology  to  yeast 
GcnSp  has  been  described  (7,  41),  A  human  ADA2  gene  has 
also  been  cloned,  indicating  a  conservation  of  adapter  and 
histone  acetyltransferase  functions  across  species  (7).  In  addi¬ 
tion,  a  cDNA  encoding  a  second,  larger  GcnS-related  protein 
that  possesses  unique  sequences  in  its  amino-terminal  half  has 
been  identified.  This  protein,  P/CAF  (p300/CREB  binding 
protein  [CBP]-associated  factor),  associates  with  two  highly 
related  proteins,  p300  and  CBP,  that  have  a  region  of  homol¬ 
ogy  with  ADA2  (41).  Interestingly,  p300  and  CBP  are  also 
histone  acetyltransferases  (1,  29).  Interactions  between  P/CAF 
and  p300  or  CBP  are  disrupted  by  the  viral  ElA  oncogene 
product,  and  this  disruption  is  required  for  cellular  transfor¬ 
mation  by  ElA  (41).  Proper  association  of  these  histone  acetyl¬ 
transferase  activities,  then,  is  extremely  important  for  normal 
cell  growth  (32). 

In  order  to  further  study  the  functions  of  histone  acetyltrans¬ 
ferases  in  the  growth  and  development  of  mammalian  cells,  we 
endeavored  to  isolate  sequences  encoding  mouse  GCNS 
(mmGCNS)  and  mmP/CAF.  To  our  surprise,  although  our 
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mmGCNS  exhibited  98%  identity  with  the  reported  human 
GCN5  (hsGCN5)  sequence,  the  mouse  cDNA  encoded  an 
extended  amino-terminal  domain  with  high  similarity  to  a  cor¬ 
responding  domain  in  P/CAF.  Upon  further  examination,  we 
found  that  the  reported  hsGCN5  cDNA  (41)  may  result  from 
an  incompletely  spliced  transcript,  and  that  a  more  prevalent 
transcript  exists  that  potentially  encodes  a  longer  hsGCNS 
protein  similar  to  that  encoded  by  the  mouse  cDNA  that  we 
isolated.  Moreover,  in  contrast  to  previous  reports  that  yeast 
and  human  GCN5  proteins  acetylate  only  free  core  histones, 
the  full-length  recombinant  mmGGNS  pfotein  containing  this 
extended  amino-terminal  region  acetylates  both  free  and  nu- 
cleosomal  histones  H3  and  H4.  These  results  suggest  that  this 
additional  domain  in  the  mammalian  GCN5  acetyltransferase 
facilitates  chromatin  recognition.  Interestingly,  P/CAF  and 
GCN5  are  expressed  in  inverse  ratios  in  many  mouse  tissues, 
indicating  that  these  proteins  may  serve  tissue-specific  func¬ 
tions. 

MATERIALS  AND  METHODS 

cDNA  library  screening.  Nested  PCR  with  degenerate  oligomers  and  a  mouse 
embryonic  cDNA  library  (13.5  days  postcoitum  [dpc])  as  the  template  was  per¬ 
formed  to  generate  a  fragment  of  the  mmGCNS  cDNA.  Oligomers  were  chosen 
from  regions  of  sequence  conserved  between  yeast  and  Tetrahymena,  which 
correspond  to  amino  acids  131  to  244  of  the  yeast  protein  sequence.,  A  single 
band  of  123  bp  was  generated  and  cloned  into  pBluescript  (Stratagei]^e).  Se¬ 
quencing  revealed  80%  nucleotide  identity  and  94%  idendty  at  the  amitio  acid 
level  to  the  reported  hsGCNS.  This?  PCR  product  and  human  EST  clones 
(IMAGE  clone  no.  243927)  with  siniilarity  to  GCN5  were  used  together  to 
screen  a  cDNA  library  under  conditions  of  low  stringency  as  previously  described 
(11).  Clones  were  plaque  purified  and  rescued  as  per  the  manufacturer’s  proto¬ 
col.  Sequencing  revealed  two  types  of  clones)  some  with  similarity  to  hsGCNS 
and  some  with  similarity  to  hsPfCAF.  All  of  the  P/CAF  clones  contained  only  a 
short  piece  of  P/CAF,  and  rescreening  of  the  library  failed  to  isolate  any  longer 
clones.  Therefore,  an  oligomer  corresponding  to  the  5 '-most  sequence  of  mmP/ 
CAF  was  used  to  screen  a  10.5-dpc  embryonic  mouse  plasmid  library  with 
GeneTrapper  technology  (Gibco  BRL).  Additional  clones,  corresponding  to 
full-length  P/CAF  sequences,  were  isolated  according  to  the  manufacturer’s 
protocol. 

Genomic  library  screening.  A  mouse  genomic  library,  Lambda  FIXII  (Strat- 
agene),  was  screened  by  using  a  mbtture  of  a  5'  fragment  of  the  mmGCNS  cDNA 
and  a  5'  fragment  of  the  mmP/CAF  cDNA.  Positive  plaques  were  picked  and 
subjected  to  secondary  screening.  Phage  DNA  was  prepared  from  positive 
plaques  by  standard  procedures.  Genomic  inserts  were  released  from  phage 
DNA  by  Notl  digestion  and  subsequently  subcloned  into  Bluescript  KS(+) 
(Stratagene). 

Sequencing  analysis.  DNA  sequencing  was  performed  by  using  the  Thermo- 
Sequenase  radiolabeled  terminator  cycle  sequencing  kit  (Amersham  Life  Sci¬ 
ence).  Sequencing  amplification  conditions  were  94“C  for  30  s,  55°C  for  30  s,  and 
72°C  for  1  min  for  40  cycles.  Alternatively,  automated  sequencing  was  carried  out 
by  the  Sequencing  Core  Facility  at  the  M.  D.  Anderson  Cancer  Center. 

Sequence  alignment.  Published  sequences  were  obtained  by  searching  the 
GenBank,  PIR-Protein,  and  SWISS-PROT  databases.  Sequence  alignment  was 
carried  out  with  the  Genetics  Computer  Group  (GCG)  (Wisconsin  Package 
version  9.1;  GCG,  Madison,  Wis.)  Pileup  program.  Percent  identity  between  two 
proteins  was  calculated  by  using  the  GCG  Bestfit  program. 

Linkage  analysis  mapping.  Restriction  fragment  length  polymorphisms  for 
mmGCNS  or  mmP/CAF  in  C57BL/6J  and  SPRET/Ei  subspecies  were  deter¬ 
mined  by  using  genomic  DNA.  purchased  from  the  Jackson  Laboratory.  The 
Jackson  Laboratory  interspecific  backcross  panel  (C57BL/6JEi  x  SPRET/Ei)Fi 
X  SPRET/Ei,  known  as  Jackson  BSS  (33),  was  then  used  to  map  the  chromo¬ 
somal  locations  of  the  mmGCNS  and  mmP/CAF  genes.  Predigested  panels  (Bglll 
digestion  for  P/CAF  or  Xbal  digestion  for  GCNS)  were  analyzed  by  Southern 
blotting  with  a  GCNS  or  P/CAF  intronic  probe.  Typing  results  were  processed  via 
the  Jackson  Laboratory  database  analysis  (see  http://www.jax.org/resources/doc- 
uments/cmdata  for  raw  data). 

RT-PCR.  Isolation  of  total  RNA  from  various  mouse  tissues  was  performed  as 
described  previously  (10).  RNA  was  digested  with  RNase  A-free  DNase  I  (Am- 
bion)  for  30  min  at  37°C.  Reverse  transcriptase  PCR  (RT-PCR)  was  performed 
with  an  RT-PCR  kit  (Perkin-Elmer)  accordinjg  to  manufacturer’s  protocols. 
Reverse  transcription  was  carried  out  at  42®C  for  15  min,  followed  by  heating  at 
95°C  for  5  min.  PCRs  were  carried  out  at  95°C  for  60  s  and  60°C  for  60  s  for  35 
cycles  as  suggested  by  the  manufacturer.  Primer  A  (see  Fig.  3B  for  sequence 
location)  for  RT-PCR  is  CTGGTGCCTGAGAAGAGGAC;  primer  B  (see  Fig. 
3B)  is  CTCCGAAGGTGGCATGGTGAAG. 

RNA  analysis.  Total  RNA  from  adult  mouse  tissues  or  whole  embryos  (13.5 
dpc)  was  extracted  as  described  previously  (10).  RNAs  were  electrophoresed  on 


a  1.1%  agarose  gel  containing  formaldehyde,  along  with  RNA  molecular  size 
markers  (Gibco  BRL).  RNA  was  transferred  to  a  GeneScreen  Plus  membrane 
(NEN  Life  Science)  in  lOx  SSC  (IX  SSC  is  0.15  M  NaCl  plus  0.015  M  sodium 
citrate).  Hybridization  was  carried  out  with  mmGCNS-  and  mmP/C4F-specific 
probes. 

GCNS  protein  analysis.  Mouse  embryos  (12.5  dpc)  were  homogenized  in 
radioimmunoprecipitation  assay  buffer  (lx  phosphate-buffered  saline,  1%  Non- 
idet  P-40, 0.5%  sodium  deoxycholate,  0.1%  sodium  dodecyl  sulfate  [SDS],  100  p,g 
of  phenylmethylsulfonyl  fluoride  per  ml,  1  pg  of  aprotinin  per  ml)  and  then 
centrifuged  at  15,000  X  g  for  20  min  at  4°C.  Supernatant  was  collected  for 
Western  blotting,  and  GCNS  was  immunoprecipitated  with  the  polyclonal  hs¬ 
GCNS  antibody  (generously  provided  by  Shelley  Berger,  Wistar  Institute)  by  the 
protocol  of  Santa  Cruz  Biotech,  Inc.  HeLa  cell  nuclear  extract  was  kindly  pro¬ 
vided  by  Warren  Liao  and  Yongsheng  Ren  (M.D.  Anderson  Cancer  Center). 

Cloning  and  expression  of  full-length  mmGCNS  and  mmP/CAF.  Comparison 
of  the  mmGCNS  genomic  and  cDNA  clones  revealed  that  the  isolated  cDNA 
lacks  the  sequences  encoding  the  first  74  amino  acids.  These  sequences  (which 
lack  introns)  were  excised  from  the  GCNS  genomic  clone  by  Ncol  and  teHII 
digestion  and  inserted  into  the  appropriate  position  of  the  cDNA  clone  to 
generate  a  full-length  mmGCNS  cDNA,  as  verified  by  DNA  sequencing.  Full- 
length  mmGCNS  was  subcloned  into  the  Ncol  and  Hindlll  sites  of  the  pRSETB 
vectors  (Invitrogen),  such  that  an  N-terminal  His^  tag  was  fused  in  frame  with  the 
coding  region.  Similarly,  full-length  mmP/CAF  was  subcloned  into  the  BamYil 
and  Kpnl  sites  of  the  pRSETB  vector.  Hisg-tagged  proteins  were  induced  in 
BL21-DE3  bacterial  cells  by  addition  of  1  mM  IPTG  (isopropyl-p-D-thiogalac- 
topyranoside).  Recombinant  protein  was  purified  by  using  nickel-nitrilotriacetic 
acid  resin  (Qiagen)  according  to  the  manufacturer’s  protocol.  Purified  recombi¬ 
nant  proteins  were  verified  by  Western  blot  analysis  with  an  antibody  specific  to 
the  HiSs  tag  (Clontech). 

Acetyltransferase  assays.  Acetyltransferase  assays  were  performed  as  previ¬ 
ously  described  (4,  5).  HeLa  cell  mononucleosomes  or  core  histones  were  the 
kind  gift  of  Jerry  Workman,  and  the  cysteine-linked  peptides  (corresponding  to 
amino  acids  1  to  20  of  H3  or  to  this  same  region  with  substitution  of  acetyl-lysine 
at  positions  9  and  14)  were  the  gift  of  C.  David  Allis.  Calf  thymus  histones  were 
purchased  from  Worthington  Biochemical  Corporation  (Freehold,  N.J.).  Acet¬ 
ylation  assays  were  performed  in  10-  to  30-p,l  volumes  with  either  10  jxg  of 
histones  or  the  indicated  amount  of  synthetic  peptide.  Following  incubation  at 
30“C  for  30  min,  an  aliquot  of  each  reaction  mixture  was  processed  for  liquid 
scintillation  counting  (P81  filter  assay)  as  described  by  Brownell  et  al.  (5),  and 
when  appropriate,  another  aliquot  was  electrophoresed  on  an  SDS-22%  poly¬ 
acrylamide  gel  and  histones  were  visualized  by  Coomassie  blue  staining  and 
autoradiography. 

GST  fusion  protein  interaction  assays.  Glutathione  5-transferase  (GST)- 
CBP/p300  interaction  assays  were  performed  as  described  by  Yang  et  al.  (41) 
except  that  crude  bacterial  lysates  containing  His-tagged  recombinant  P/CAF, 
GCN5,  or  HIRA  were  used  and  the  interactions  were  detected  by  Western 
blotting  with  the  His  tag  antibody. 


RESULTS 

Cloning  of  mmGCNS.  In  order  to  study  the  function  of 
acetyltransferases  in  a  mammalian  system,  we  endeavored  to 
clone  mouse  GCNS  homologs.  First  we  generated  a  fragment 
of  the  mmGCNS  cDNA  using  a  nested  PCR  strategy  employ¬ 
ing  degenerate  primers  homologous  to  conserved  regions  of 
the  yeast  GCNS  and  the  Tetrahymena  p55  genes.  To  further 
enhance  the  probability  of  identifying  GCA/5-related  se¬ 
quences,  this  fragment  was  used  together  with  a  human  GCNS 
EST  to  screen  a  13.5-dpc  mouse  embryonic  cDNA  library 
under  conditions  of  low  stringency  (11).  Multiple  positive 
clones  were  identified,  and  upon  sequencing,  these  were  found 
to  contain  open  reading  frames  predicted  to  encode  proteins 
with  significant  homology  to  either  hsGCNS  or  hsP/CAF  (Fig, 
1). 

One  cDNA  clone  contained  an  open  reading  frame  encod¬ 
ing  756  amino  acids,  and  the  C-terminal  portion  of  this  pre¬ 
dicted  amino  acid  sequence  exhibited  98%  identity  with  the 
reported  hsGCNS  sequence,  but  only  71%  homology  to  the 
hsP/CAF  sequence,  over  the  length  of  the  predicted  proteins 
(Fig.  2A).  We  tentatively  concluded  that  this  cDNA  clone 
likely  contains  the  mmGCNS  gene,  as  confirmed  below. 

We  next  used  a  fragment  from  the  5'  end  of  this  clone  to 
screen  a  library  of  mouse  genomic  sequences.  Three  different 
clones  were  isolated,  and  restriction  analysis  and  sequencing 
indicated  that  all  three  clones  harbored  the  entire  mmGCNS 
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FIG.  1.  Alignment  of  the  mmGCNS,  mmP/CAF,  and  reported  hsGCNS  amino  acid  sequences.  Identical  amino  acids  are  shaded.  Amino  acid  deletions  are  indicated 
with  a  dotted  line.  The  locations  of  the  histone  acetyltransferase  (HAT)/acetyl  coenzyme  A  binding  regions  (20,  38)  and  the  bromodomain  motif  (17)  are  indicated. 
The  full  bromodomain  likely  encompasses  amino  acids  363  to  472  of  hsGCNS  (19). 
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FIG.  2.  Comparison  of  GCN5  and  P/CAF  sequences  across  species.  (A)  Schematic  comparisons  of  mmGCNS  and  mmP/CAF  sequences  with  GCN5  and  P/CAF 
proteins  from  other  species.  Published  sequences  were  obtained  by  searching  the  PIR-Protein  and  SWISS-PROT  databases.  Positions  of  the  putative  catalytic  domains 
and  the  bromodomains  are  indicated  above  the  diagram.  Percent  identities  between  proteins  are  indicated  on  the  right.  yGcnSp,  yeast  GcnSp;  HAT,  histone 
acetyltransferase;  aa,  amino  acids.  The  dotted  box  indicates  the  existence  of  a  predicted  extended  amino-terminal  region  in  the  hsGCNS  protein.  (B)  Comparison  of 
the  Kozak  consensus  sequence  for  translation  start  sites,  the  predicted  mmGCN5  translation  start  sites,  and  the  previously  reported  hsGCN5  translation  start  site  (41). 
The  underlined  AUG  is  the  codon  for  the  initiator  methionine.  The  A  of  the  AUG  is  designated  -1-1.  In  the  consensus  sequence,  the  nucleotides  at  positions  -3  and 
+4  have  the  greatest  impact  on  translation  efficiency;  97%  of  vertebrate  mRNAs  have  a  purine  (A  or  G,  preferably  A)  in  position  -3,  and  46%  have  a  G  in  position 
+4  (21,  22). 


gene.  Comparison  of  the  genomic  and  cDNA  clones  of 
mmGCNS  revealed  that  the  cDNA  clone  isolated  as  described 
above  actually  lacked  the  first  74  amino-terminal  codons  and 
that  the  mmGCNS  gene  is  divided  into  19  exons  and  contains 
relatively  small  (85-bp  to  1-kb)  introns  (Fig.  3A).  We  inserted 
sequences  from  the  genomic  clone  containing  the  missing  ami¬ 
no-terminal  codons  into  the  cDNA  clone  to  generate  a  full- 
length  (encoding  830  amino  acids)  recombinant  mmGCNS 
cDNA. 

The  two  previously  reported  hsGCNS  sequences  differ  in  the 
position  of  the  initiating  methionine,  such  that  one  reported 
sequence  contains  49  additional  amino-terminal  amino  acids 
relative  to  the  other  (7, 41).  The  mmGCNS  open  reading  frame 
also  encodes  these  additional  amino  acids,  but  the  open  read¬ 
ing  frame  is  further  extended  for  some  distance  upstream  of 
these  sequences,  potentially  encoding  356  additional  amino 
acids.  The  context  of  the  predicted  translation  initiation  site  in 
this  extended  region  of  mmGCNS  matches  well  the  Kozak 
consensus  sequence  (Fig.  2B)  (21,  22).  Moreover,  the  amino 
acids  in  this  amino-terminal  extension  exhibit  more  than  66% 
identity  to  sequences  in  the  corresponding  regions  of  both 


mouse  (see  below)  and  human  P/CAF,  and  the  length  of  this 
extended  region  is  similar  to  that  of  the  P/CAF  proteins.  These 
data  indicate  that  mmGCNS  encodes  a  protein  that  is  very 
homologous  to  yeast  GcnSp  and  is  almost  identical  to  the 
previously  reported  hsGCNS  but  that  contains  an  extended 
N-terminal  domain  homologous  to  P/CAF  in  both  size  and 
sequence. 

incomplete  splicing  might  yield  a  shorter  GCN5  protein  in 
mouse  and  human  cells.  We  were  interested  in  determining 
the  basis  of  the  incongruity  in  size  between  mmGCNS  and  the 
reported  human  cDNA  Inspection  of  the  mmGCNS  genomic 
sequence  revealed  the  presence  of  an  intron  (intron  6  in  Fig. 
3 A)  10  bp  upstream  of  the  previously  reported  upstream-most 
hsGCNS  translation  initiation  site  (41).  Sequences  highly  sim¬ 
ilar  (91%  identical)  to  these  intronic  sequences  are  also 
present  in  the  predicted  S'  untranslated  region  of  the  reported 
hsGCNS  cDNA  but  are  absent  in  the  mouse  cDNA  we  isolated 
as  described  above.  These  comparisons  suggest  either  that  the 
mouse  and  human  GCNS  genes  are  subject  to  differential  splic¬ 
ing  events,  in  which  this  intron  is  either  removed  (mouse)  or 
retained  (human),  or  that  the  previously  identified  human 
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FIG.  3.  (A)  Genomic  structure  of  mmGCNS.  Exons  in  the  mmGCNS  gene  are  shown  in  boxes,  while  introns  are  represented  by  the  intervening  lines.  The  thick  line 
indicates  the  intron  that  is  retained  in  some  alternatively  spliced  variants  and  which  is  homologous  with  sequences  found  in  the  previously  reported  5'  untranslated 
region  of  the  hsGCN5  cDNA.  The  asterisks  indicate  in-frame  stop  codons  that  would  prevent  translation  of  the  full-length  protein.  Exon  7  is  the  exon  that  is  skipped 
in  the  mouse  cDNA.  Exon  8  is  the  first  coding  exon  in  the  reported  hsGCNS  cDNA.  The  positions  of  the  translation  start  codon  (ATG)  and  the  termination  codon 
(TAG)  are  indicated.  (B)  Coexistence  of  multiple  forms  of  mammalian  GCN5  transcripts.  Left  panel,  DNase  I-treated  RNAs  from  mouse  kidney,  ovary,  and  embryo 
and  human  HeLa  and  hepatoma  cell  lines  were  RT-PCR  amplified  by  using  primers  A  and  B  shown  in  the  diagram  on  the  right.  RT-PCR  products  were  resolved  on 
an  ethidium  bromide  (Et-Br)-stained  agarose  gel.  Mouse  genomic  DNA  was  also  amplified  under  the  same  conditions.  A  prevalent  product  corresponding  to  the  size 
of  the  mouse  cDNA  (without  introns  6  and  7)  was  amplified  from  all  of  the  RNA  samples,  while  other,  larger  products  corresponding  to  the  size  of  the  reported  human 
cDNA  (containing  intron  6  but  not  intron  7)  were  barely  detected.  Middle  panel,  the  RT-PCR  products  were  transferred  to  nylon  membrane  and  hybridized  with 
mmGCNS  cDNA  sequences.  Right  panel,  the  same  blot  from  the  middle  panel  was  stripped  and  rehybridized  with  a  probe  (probe  A)  specific  to  the  conserved  intron 
6.  (C)  The  RT-PCR  products  described  above  were  gel  purified  and  then  amplified  by  PCR  with  a  nested  pair  of  primers.  The  PCR  products  were  then  subjected  to 
DNA  sequencing.  The  nucleotide  sequence  of  the  317-bp  fragment  is  shown.  The  smaller  (292-bp)  fragment  has  the  same  sequence  as  the  larger  fragment  except  that 
it  lacks  exon  7  sequences.  Exons  are  boxed,  and  introns  are  numbered  underneath.  In-frame  stop  codons  are  in  boldface  and  marked  by  asterisks.  The  mmGCNS  reading 
frame  separated  by  intron  6  and  exon  7  is  shown.  The  initiator  methionine  codon  for  the  reported  hsGCNS  (41)  protein  is  in  boldface  and  underlined. 
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FIG.  4.  Detection  of  both  long  and  short  GCN5  proteins.  (A)  Left  panel,  protein  extracts  were  prepared  from  U20S  cells  or  HeLa  cell  nuclei  and  probed  with 
polyclonal  antibodies  to  hsP/CAF  or  hsGCN5,  as  indicated.  Right  panel,  protein  extracts  were  prepared  from  12.5-dpc  mouse  embryos  or  HeLa  cells  and  probed  with 
the  hsGCNS  antibody.  (B)  GCN5  proteins  were  immunoprecipitated  (IP)  from  12.5-dpc  mouse  embryos  and  then  probed  with  hsGCN5  antibody.  An  unrelated  HIRA 
polyclonal  antibody  was  used  as  a  negative  control.  ^ 
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cDNA  sequence  is  incomplete.  Interestingly,  a  conserved,  in¬ 
frame  stop  codon  is  found  near  the  beginning  of  intron  6,  and 
retention  of  this  intron  would  prevent  translation  of  the  larger 
protein  in  both  mouse  and  human  cells,  perhaps  yielding  a 
smaller  protein  with  a  size  corresponding  to  that  previously 
predicted  for  hsGCNS. 

To  investigate  the  possibility  of  alternative  (or  incomplete) 
splicing  of  mouse  and  human  GCN5  transcripts,  we  performed 
RT-PCR  on  total  RNA  isolated  from  human  HeLa  cells,  hu¬ 
man  hepatoma  cells,  mouse  kidney,  mouse  ovary,  and  a  13.5- 
dpc  mouse  embryo.  All  RNAs  were  treated  with  RNase-free 
DNase  I  before  RT-PCR  to  remove  any  genomic  DNA  from 
the  samples.  An  mmGCN5  genomic  DNA  clone  was  used  in  a 
separate  reaction,  as  a  positive  control  for  the  presence  of  the 
intron  sequences.  Two  primers  corresponding  to  conserved 
sequences  in  exons  6  and  8,  which  flank  introns  6  and  7  (Fig. 
3A  and  B),  were  used  for  the  amplification.  The  RT-PCR 
products  were  separated  on  an  agarose  gel,  transferred  to  a 
membrane,  and  then  probed  sequentially  with  mmGCNS 
cDNA  sequences  or  intron  6  sequences. 

A  predominant  RT-PCR  product  of  a  size  corresponding  to 
the  spliced  cDNA  (lacking  the  intron)  was  amplified  from 
mouse  embryonic,  kidney,  and  ovarian  RNAs  (lower  band  in 
Fig.  3B).  As  expected,  this  product  was  significantly  smaller 
(126  bp)  than  the  amplification  product  from  the  genomic 
DNA  (about  1  kb),  which  contains  introns  6  and  7.  This  small 
product  hybridized  to  the  mmGCNS  cDNA  sequences  but  not 
to  the  intron  6  probe,  consistent  with  the  removal  of  these 
intronic  sequences  by  splicing.  In  contrast,  two  less  abundant, 
closely  spaced  bands  were  detected  by  both  the  cDNA  and  the 
intron  6  probes.  An  intron  7  probe  hybridized  only  to  the 
genomic  DNA  but  failed  to  detect  any  of  the  RT-PCR  prod¬ 
ucts  (data  not  shown),  suggesting  that  intron  7  had  been  re¬ 
moved  in  all  of  the  transcripts.  Sequencing  of  the  larger,  closely 
spaced  RT-PCR  products  revealed  that  they  represent  two 
alternatively  spliced  variants  of  mmGCNS  (Fig.  3C).  Both  of 
these  variants  retained  intron  6,  but  one  also  contained  a  novel 


25-bp  exon  (exon  7)  located  between  introns  6  and  7.  Intron  7 
was  removed  from  both  of  these  alternatively  spliced  products, 
bringing  the  stop  codons  in  intron  6  to  a  position  just  upstream 
of  the  ATG  sequence  corresponding  to  the  previously  pre¬ 
dicted  translation  start  site  of  hsGCNS.  Together  these  data 
indicate  that  the  predominant  form  of  the  mouse  cDNA  is 
completely  spliced,  lacks  these  stop  codons,  and  therefore  is 
predicted  to  encode  the  longer  version  of  GCN5.  However,  the 
two  minor  RT-PCR  products  that  we  observed  might  encode 
shorter  GCN5  proteins,  consisting  of  the  amino-terminal, 
P/CAF-like  domain  in  isolation  or  of  the  C- terminal  domain, 
which  is  most  similar  to  yeast  GCN5. 

RT-PCR  of  total  RNA  from  human  cells  revealed  a  similar 
mixture  of  completely  and  incompletely  spliced  RNAs.  For 
example,  two  RT-PCR  products  were  generated  from  the  hu¬ 
man  HeLa  cell  and  hepatoma  cell  RNAs.  The  size  of  the  more 
abundant,  smaller  product  again  is  consistent  with  a  spliced 
cDNA  lacking  sequences  homologous  to  the  mouse  intron  6 
and  exon  7,  and  this  product  hybridizes  only  to  cDNA  se¬ 
quences.  The  less  abundant,  larger  product  hybridizes  to  both 
intron  and  exon  sequences  (Fig.  3B,  middle  panel).  We  suggest 
that  the  longer  product  likely  corresponds  to  the  hsGCNS 
cDNA  sequences  previously  reported,  whereas  the  more  prev¬ 
alent,  shorter  form  represents  a  spliced  product  predicted  to 
encode  a  longer  protein  analogous  to  that  encoded  by  the 
mouse  cDNA  isolated  as  described  above. 

Long  GCN5  proteins  are  present  in  both  human  and  mouse 
cells.  To  identify  the  size  of  the  native  mammalian  GCN5 
protein(s),  total  cell  extracts  prepared  from  a  12.5-dpc  mouse 
embryo  or  human  HeLa  cells  were  probed  with  a  polyclonal 
serum  raised  against  the  previously  described  hsGCN5  (gen¬ 
erously  provided  by  Shelley  Berger,  Wistar  Institute).  The 
hsGCN5-specific  antiserum  detected  a  98-kDa  protein  in  the 
HeLa  cell  nuclear  extracts,  consistent  with  the  predicted  size  of 
the  full-length  GCN5  protein  containing  the  extended  amino- 
terminal  region  (Fig.  4A,  left  panel).  To  ensure  that  this  band 
corresponded  to  mmGCN5  and  that  the  hsGCN5  antibody  did 
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not  cross-react  with  P/CAF,  we  compared  the  relative  signals 
obtained  with  the  hsGCN5  antibody  and  a  P/CAF  antibody 
(generously  provided  by  Yoshihiro  Nakatani,  National  Insti¬ 
tutes  of  Health)  with  extracts  from  U20S  cells  or  HeLa  cells. 
The  P/CAF  antibody  recognized  a  single  band  in  the  U20S 
extract,  consistent  with  previous  reports  that  P/CAF  is  well 
expressed  in  these  cells  (41),  and  in  the  HeLa  cell  nuclear 
extract.  The  hsGCNS  antibody,  however,  did  not  recognize  any 
proteins  of  a  similar  size  in  either  extract  but  did  recognize  a 
prominent  band  of  ~98  kDa  in  the  HeLa  cell  nuclear  extract. 
Therefore,  the  hsGCN5  antibody  does  not  appear  to  cross- 
react  significantly  with  PCAF,  and  we  conclude  that  the  98- 
kDa  protein  recognized  by  this  antibody  in  HeLa  cell  extracts 
is  GCN5. 

The  hsGCN5  antibody  also  recognized  a  faint  60-kDa  band 
(lower  arrow  in  right  panel  of  Fig.  4A)  in  the  HeLa  cell  ex¬ 
tracts,  close  to  the  predicted  size  of  the  shorter  GCN5  protein 
described  previously  (38)  and  above.  Thus,  both  the  long  and 
short  forms  of  GCN5  appear  to  be  expressed  in  these  cells,  but 
the  longer  form  appears  to  be  predominant.  Interestingly,  the 
long  form  of  GCN5  was  the  only  form  detected  in  mouse 
embryo  extracts.  The  expression  of  GCN5  protein  in  the  em¬ 
bryonic  extracts  is  consistent  with  high  levels  of  GCN5-specific 
RNA  detected  in  these  tissues  (see  Fig.  5).  Moreover,  since 
only  very  low  levels  of  P/CAF  RNA  were  detected  at  this  (or 
any)  stage  of  mouse  embryogenesis  (data  not  shown  and  see 
Fig.  5),  these  data  further  support  our  conclusion  that  the 
hsGCNS  antibody  recognizes  mmGCN5  rather  than  mmP/ 
CAF.  Neither  the  long  nor  the  short  form  of  GCN5  was  de¬ 
tected  by  control,  preimmune  serum  in  either  the  mouse  or 
human  extracts  (data  not  shown). 

We  also  used  the  anti-hsGCN5  serum  to  immunoprecipitate 
GCN5  proteins  from  the  mouse  embryo  extract.  Precipitated 
proteins  were  then  detected  by  Western  blotting  with  the  same 
serum.  Again,  a  98-kDa  protein  was  detected  by  the  hsGCN5 
antibody  but  not  by  a  control  rabbit  serum  (Fig,  4B).  Unfor¬ 
tunately,  the  shorter  form  of  GCN5,  if  it  was  present,  would 
comigrate  with  the  immunoglobulin  G  band  and  thus  could  not 
be  detected  by  this  approach.  Nevertheless,  these  experiments 
confirm  the  presence  of  the  longer  GCN5  protein  in  mouse 
embryos. 

Cloning  of  mmPICAF.  A  second  GCAJ-related  cDNA  clone 
that  contained  a  high  degree  of  similarity  to  hsPjCAF  was 
isolated  in  our  screen  of  the  mouse  cDNA  library.  Since  all 
initial  clones  appeared  to  be  incomplete,  containing  an  867-bp 
fragment  of  the  cDNA  (relative  to  the  human  sequence),  a 
second  library  was  screened  by  using  GeneTrapper  technology. 
Multiple  full-length  cDNAs  containing  an  open  reading  frame 
predicted  to  encode  813  amino  acids  were  obtained.  This  open 
reading  frame  exhibited  93%  identity  to  the  hsPICAF  cDNA 
sequence  but  only  75%  identity  to  the  reported  hsGCNS 
cDNA  sequence  (41).  We  therefore  designated  this  clone 
mmPICAF.  Both  the  mmGCN5  and  the  mmP/CAF  sequences 
possess  predicted  catalytic  domains  and  bromodomains  iden¬ 
tified  in  a  number  of  recently  identified  histone  acetyltrans- 
ferases,  including  several  highly  conserved  amino  acids  near 
the  putative  catal)^ic  center  (Fig.  1). 

Using  a  fragment  from  the  5'  region  of  the  mmPICAF  cDNA 
as  a  probe,  we  identified  multiple  clones  from  a  library  of 
mouse  genomic  sequences  that  contained  PICAF  sequences. 
Four  of  these  contained  different  portions  of  the  cDNA  se¬ 
quence,  These  clones  indicate  that  in  contrast  to  the  mmGCNS 
gene,  which  contains  small  introns  (a  few  hundred  base  pairs 
each),  the  mmPICAF  gene  contains  very  large  introns  (16  to  20 
kb).  Because  of  these  large  introns,  we  have  not  completed 
cloning  of  mmPICAF  genomic  sequences. 


P/CAF 

GCN5 

18S  rRNA 


FIG.  5.  Ubiquitous  and  complementary  expression  of  mmGCN5  and  mmP/ 
CAF.  (Top  panel)  Total  RNA  was  isolated  from  various  mouse  tissues  and 
embryos  as  indicated.  Northern  blot  hybridization  was  performed  with  a  mixture 
of  mmGCN5  and  mmPICAF  cDNA  probes.  Two  transcripts  were  detected  and 
are  indicated  by  arrows  on  the  right.  The  identities  of  the  transcripts  were 
confirmed  by  Northern  blot  hybridization  with  single  GCN5  or  PfCAF  probes 
(data  not  shown).  Positions  of  RNA  molecular  size  standards  (in  kilobases)  are 
shown  on  the  left.  (Bottom  panel)  Ethidium  bromide-stained  gel  showing  18S 
rRNAs. 


Interestingly,  several  clones  identified  in  our  genomic 
screens  apparently  contain  a  PICAF  pseudogene.  No  intronic 
sequences  are  present  in  these  clones,  and  several  base  substi¬ 
tutions,  relative  to  the  cDNA  sequence,  are  scattered  through¬ 
out  the  predicted  coding  region  of  the  pseudogene,  RT-PCR 
analysis  indicates  that  the  pseudogene  is  not  expressed  in  sev¬ 
eral  mouse  tissues  examined,  including  brain,  eye,  heart,  lung, 
liver,  kidney,  thymus,  spleen,  fat,  diaphragm,  small  intestine, 
ovary,  testis,  or  a  13.5-dpc  embryo  (data  not  shown). 

Ubiquitous  but  complementary  expression  of  mmGCNS  and 
mmPICAF.  To  examine  and  compare  the  expression  of 
mmGCNS  and  mmPICAF,  total  RNA  was  extracted  from  var¬ 
ious  mouse  tissues,  subjected  to  denaturing  electrophoresis, 
transferred  to  a  membrane,  and  then  probed  with  mmGCNS- 
or  mmPICAF-specific  sequences. 

A  single  transcript  of  3.3  kb  was  detected  in  all  tissues  with 
the  GCNS  probe,  consistent  with  size  of  the  cDNA  clone  we 
isolated.  Similarly,  a  single,  ubiquitous  transcript  was  detected 
with  the  PICAF  probe,  and  the  size  of  this  RNA,  4.4  kb,  is 
similar  to  that  of  the  PICAF  cDNA  that  we  isolated.  Interest¬ 
ingly,  the  PICAF  RNA  always  exhibited  a  broader  banding 
pattern  than  did  the  GCNS  RNA.  These  two  RNAs  were 
clearly  distinguished  from  one  another  when  probed  on  the 
same  blot,  and  a  differential  pattern  of  expression  was  detected 
(Fig.  5),  For  example,  the  ratio  of  mmGCNS  to  mmPICAF 
expression  is  higher  in  brain,  thymus,  spleen,  testis,  and  13.5- 
dpc  embryonic  tissue,  while  this  ratio  is  much  lower  in  heart, 
liver,  kidney,  and  skeletal  muscle.  Western  blot  analysis  of 
GCN5  protein  levels  (with  the  polyclonal  antiserum  to  hs- 
GCN5  described  above)  in  various  mouse  tissues  confirmed 
the  general  pattern  of  expression  indicated  by  this  RNA  anal¬ 
ysis  (data  not  shown). 

Chromosomal  locations  of  the  mmGCNS  and  mmPICAF 
genes.  The  chromosomal  location  of  the  mmGCNS  gene  was 
mapped  by  standard  linkage  analysis  with  the  Jackson  Labo¬ 
ratory  interspecific  backcross  panel  (C57BL/6Jei  X  SPRET/ 
Ei)Fi  X  SPRET/Ei,  also  known  as  Jackson  BSS  (33). 
mmGCNS  mapped  cleanly  to  a  distal  region  on  chromosome 
11  and  cosegregated  tightly  with  BRCAl,  as  well  as  with  a 
number  of  other  genes  previously  mapped  to  that  locus  (data 
not  shown,  but  raw  data  from  the  Jackson  Laboratory  are 
available  at  http://www.jax.org/resources/documents/cmdata). 
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FIG.  6.  Acetylation  of  histone  H3  synthetic  peptides  by  nimGCN5  and  mmP/CAF.  (A  and  B)  Results  of  acetyltransferase  assays  with  recombinant  full-length 
mmGCN5  and  synthetic  peptides  corresponding  to  the  amino-terminal  tail  of  histone  H3.  (C  and  D)  Results  of  peptide  assays  with  recombinant  mmP/CAF.  Peptides 
were  either  unacetylated  (un)  or  synthesized  with  acetyl-lysine  residues  at  either  K9  and  K18  (di9,18)  or  K9  and  K14  (di9,14)  in  H3.  Vector  indicates  assay  of  a  control 
extract,  made  from  bacteria  transformed  by  the  recombinant  vector  without  an  acetyltransferase  insert,  subjected  to  the  His  tag  purification  procedure. 


Interestingly,  the  hsGCN5  gene  was  recently  mapped  by  fluo¬ 
rescent  in  situ  hybridization  analysis  to  a  syntenic  region  of 
human  chromosome  17  (9)  and  was  also  found  to  cosegregate 
with  human  BRCAl. 

The  location  of  mmPjCAF  was  mapped  in  a  similar  fashion, 
using  the  same  backcross  panel.  In  this  case  we  used  a  probe 
specific  for  intronic  sequences  to  ensure  that  we  mapped  the 
authentic  mmPICAF  gene  and  not  the  PICAF  pseudogene. 
This  analysis  indicated  that  mmPICAF  is  located  32  centimor- 
gans  from  the  centromere  of  mouse  chromosome  17  and  that 
it  cosegregates  with  the  DNA  marker  D17Bir8  (see  www  ad¬ 
dress  above). 

mmGCNS  encodes  a  histone  acetyltransferase  with  sub¬ 
strate  specificity  similar  to  that  of  P/CAF.  The  high  degree  of 
homology  between  the  mouse,  human,  and  yeast  GCN5  pro¬ 
teins  strongly  predicts  that  mmGCNS  and  mmP/CAF  will  ex¬ 
hibit  histone  acetyltransferase  activity.  We  confirmed  this  ini¬ 
tially  by  examining  the  activities  of  the  isolated,  conserved 
acetylase  domains  of  mmGCNS  and  mmP/CAF,  expressed  as 
recombinant  proteins  in  Escherichia  coli.  As  expected,  this 
domain  of  mmGCNS  was  quite  active  as  a  histone  acetylase, 
and  it  preferentially  acetylated  free  (nonnucleosomal)  histone 
H3,  and  to  a  lesser  degree  H4,  as  does  yeast  GcnSp  (23)  and 
the  previously  reported  form  of  the  hsGCNS  protein  (41), 
Full-length  mmGCNS  and  mmP/CAF  recombinant  proteins 
(also  expressed  in  bacteria)  exhibited  this  same  substrate  spec¬ 
ificity  towards  free  histones  (Fig.  6  and  data  not  shown). 


To  determine  which  residues  of  histone  H3  were  acetylated 
by  mmGCNS,  we  performed  assays  with  synthetic  peptides 
corresponding  to  the  amino-terminal  tail  of  this  histone.  As 
expected,  we  found  that  the  full-length  GCNS  protein  efii- 
ciently  acetylated  peptides  corresponding  to  the  first  20  amino 
acids  of  histone  H3  (Fig.  6 A  and  B).  This  domain  alone,  then, 
is  sufficient  for  binding  to  the  enzyme  and  subsequent  catalysis. 
However,  mmGCNS  could  not  acetylate  a  peptide  that  con¬ 
tained  acetyl-lysine  moieties  at  positions  9  and  14  (Fig.  6B), 
suggesting  that  one  or  both  of  these  lysines  may  be  a  target  site 
for  mmGCNS.  In  contrast,  mmGCNS  readily  acetylated  a  pep¬ 
tide  containing  acetyl-lysine  moieties  at  positions  9  and  18  (Fig. 
6A).  Taken  together,  these  data  suggest  that  K14  is  the  pre¬ 
ferred  acetylation  site  in  H3  for  mmGCNS.  Similar  assays 
performed  with  H4  peptides  indicate  that  K8  is  the  preferred 
site  of  acetylation  in  H4  (data  not  shown).  These  results  are 
consistent  with  the  site  specificity  determined  for  recombinant 
yeast  GcnSp,  which  was  confirmed  by  protein  sequencing  of 
acetylated  histones  (23).  Importantly,  these  results  indicate 
that  the  extended  amino-terminal  domain  of  mmGCNS  does 
not  change  the  histone  or  lysine  residue  specificity  of  the  en¬ 
zyme. 

The  specificity  of  mmP/CAF  was  also  tested  with  the  peptide 
substrates.  In  all  respects,  mmP/CAF  exhibited  a  substrate 
specificity  identical  to  that  of  mmGCNS  (Fig.  6C  and  D). 

One  striking  difference  between  the  previously  reported, 
shorter  form  of  recombinant  hsGCNS  (or  yeast  GcnSp)  and 
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FIG.  7.  Acetylation  of  nucleosomal  histones  by  mmGCN5  and  mmP/CAF. 
Acetyltransferase  assays  were  performed  with  HeLa  cell  mononucleosomes  or 
free  histones  as  indicated,  and  an  aliquot  of  each  assay  mixture  was  resolved  on 
an  SDS-22%  polyacrylamide  gel.  Coomassie  blue-stained  gels  and  correspond¬ 
ing  autoradiographs  are  shown.  In  both  assays,  histones  H3  and  H4  were  acety- 
lated  by  the  recombinant  full-length  mmGCN5. 


recombinant  hsP/CAF  was  the  ability  of  P/CAF  to  acetylate 
nucleosomal  substrates  (23,  41),  Given  the  homology  between 
the  amino-terminal  portions  of  P/CAF  and  mmGCNS,  we 
asked  whether  the  full-length  recombinant  mmGCNS  could 


also  acetylate  histones  within  a  nucleosome.  We  found  that 
mmGCNS,  like  hsP/CAF,  can  acetylate  nucleosomal  H3  and, 
to  a  lesser  degree,  H4  (Fig,  7),  In  agreement  with  previously 
reported  results  (23,  41),  we  also  found  that  the  short  form  of 
mmGCNS  or  yeast  GcnSp  was  unable  to  acetylate  nucleosomes 
(data  not  shown).  These  results  suggest  that  one  function  of 
the  amino-terminal  domains  of  mammalian  GCN5  and  P/CAF 
may  be  to  facilitate  the  recognition  of  chromatin  templates. 

mmGCNS  and  mmP/CAF  both  interact  with  CBP  and  p300. 
hsP/CAF  interacts  with  CBP  and  p300  (41).  Given  the  similar¬ 
ity  between  mmGCNS  and  mmP/CAF,  we  examined  the  abil¬ 
ities  of  both  of  these  proteins  to  bind  to  CBP  or  p300  in  vitro 
(Fig.  8). 

Whole-cell  lysates  from  bacteria  expressing  fragments  of 
CBP  fused  to  GST  (fusion  constructs  were  kindly  provided  by 
Y.  Nakatani,  National  Institutes  of  Health)  (41)  were  mixed 
with  lysates  from  cells  expressing  the  amino-terminal  domain 
of  mmP/CAF,  the  amino-terminal  domain  of  mmGCNS,  or  the 
C-terminal  domain  of  mmGCNS.  The  CBP  fragments  (A  to  F) 
spanned  the  ADA2  homology  domain  and  extended  into  the 
transcriptional  activation  domain  (41).  A  fragment  of  p300 
(B')  homologous  to  the  B  fragment  of  CBP  was  also  tested. 
GST  fusion  proteins  were  purified  together  with  any  interact¬ 
ing  proteins  by  using  glutathione-Sepharose,  and  the  interact¬ 
ing  proteins  were  identified  by  Western  blotting  with  an  anti¬ 
serum  specific  for  the  six-histidine  tag  present  in  the 
recombinant  mmP/CAF  or  mmGCNS  protein. 
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FIG,  8.  mmGCNS  and  mmP/CAF  both  interact  with  CBP  and  p300  in  vitro.  (A)  Fragments  of  CBP  fused  to  GST  that  were  used  for  the  interaction  assays  in  panel 
C.  These  fragments  span  the  region  of  homology  to  ADA2  and  extend  into  the  transactivation  domain  of  CBP  (41).  (B)  Recombinant,  His-tagged  proteins  used  in  the 
interaction  assays  of  panel  C  were  resolved  by  SDS-polyacrylamide  gel  electrophoresis  and  probed  with  an  antibody  specific  for  the  His  tag.  The  amounts  shown 
represent  0.25  or  1%  of  the  protein  used  in  the  assay,  as  indicated.  (C)  GST-CBP  or  -p300  fusion  proteins  were  mixed  with  crude  bacterial  lysates  containing  P/CAF, 
GCN5,  or  HIRA  protein.  Interacting  proteins  were  recovered  by  using  glutathione-Sepharose  and  detected  on  Western  blots  with  the  anti-His  tag  antibody.  The  p300 
B'  fragment  is  homologous  to  the  CBP  B  fragment,  and  the  CBP  AB  fragment  is  missing  residues  1801  to  1851  (41). 
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The  amino-terminal  domain  of  mmP/CAF  selectively  bound 
to  fragments  A  and  B  of  CBP  and  the  corresponding  B'  frag¬ 
ment  of  p300.  In  some  experiments,  we  also  observed  binding 
to  the  D  fragment,  but  we  never  observed  binding  to  fragment 
C,  E,  or  F.  A  deletion  within  the  B  fragment  (AB)  of  CBP  that 
removed  residues  1801  to  1851  eliminated  binding.  This  pat¬ 
tern  of  binding  to  the  CBP/p300  fragments  is  extremely  similar 
to  that  previously  reported  for  hsP/CAF  (41),  as  expected, 

A  recombinant  form  of  hsGCN5,  which  lacked  the  amino- 
terminal  domain  reported  here  for  mmGCN5,  failed  to  bind 
CBP  or  p300  in  previous  experiments  by  Yang  et  al.  (41).  This 
form  of  hsGCN5  corresponds  to  the  C-terminal  region  of 
mmGCN5.  We  therefore  compared  binding  of  the  amino-ter¬ 
minal  and  the  C-terminal  halves  of  mmGCNS  to  the  GST-CBP 
and  -p300  fragments.  Surprisingly,  we  found  that  both  of  these 
mmGCNS  domains  bound  to  CBP  fragments  A  to  D,  with  little 
or  no  binding  to  fragment  E,  fragment  F,  or  the  AB  fragment. 
Both  the  amino-terminal  and  C-terminal  regions  of  mmGCNS 
also  bound  to  the  p300  B'  fragment.  The  amount  of  the  GST 
fusion  proteins  recovered  from  the  GST  columns  that  did  not 
exhibit  binding  to  the  GCN5  fragments  was  greater  than  or 
equal  to  that  of  the  GST  fusions  that  did  exhibit  binding  (data 
not  shown),  so  the  absence  of  binding  was  not  due  to  reduced 
amounts  of  the  E,  F,  or  AB  fragment.  In  addition,  the  selective 
binding  of  the  mmGCNS  peptides  to  CBP  fragments  A  to  D 
indicates  that  these  interactions  are  not  nonspecifically  medi¬ 
ated  by  the  GST  moiety,  since  this  moiety  is  also  present  in 
fragments  E,  F,  and  AB.  The  specificity  of  the  interactions  was 
further  tested  by  using  an  unrelated  protein,  HIRA,  which 
failed  to  bind  to  any  of  the  GST-CBP  or  -p300  fragments. 
Thus,  CBP  fragments  A  to  D  do  not  exhibit  general,  nonspe¬ 
cific  binding  to  random  proteins.  We  conclude  that  mmGCN5 
contains  two  distinct  CBP/p300  interaction  domains  and  that 
these  domains  interact  with  a  broader  region  in  CBP  than  does 
P/CAF.  Importantly,  our  finding  that  mmGCN5  and  mmP/ 
CAP  can  both  interact  with  CBP/p300  indicates  that  these 
proteins  are  very  similar  in  function  as  well  as  in  structure. 

DISCUSSION 

The  recent  identification  of  nuclear  histone  acetyltrans- 
ferases  has  directly  linked  chromatin  modification  with  tran¬ 
scriptional  regulation  (1,  5,  26,  29).  We  report  here  the  cloning 
of  mmGCNS  and  mmP/CAF  sequences.  We  find  that 
mmGCNS  differs  from  yeast  GCNS  and  the  previously  re¬ 
ported  hsGCNS  sequences  in  that  it  encodes  a  large  N-termi- 
nal  domain  similar  to  that  found  in  P/CAF.  Our  data  indicate 
that  hsGCNS  contains  this  extra  domain  as  well.  While  this 
domain  does  not  appear  to  affect  the  histone  specificity  of  the 
acetyltransferase,  it  does  afford  the  enzyme  the  ability  to  mod¬ 
ify  nucleosomal  substrates  in  vitro. 

In  vivo,  both  the  yeast  and  mammalian  enzymes  must  inter¬ 
act  with  and  modify  nucleosomal  histones.  In  yeast,  this  is 
accomplished  by  association  of  Gcn5p  into  high-molecular- 
weight  protein  complexes  that  can  modify  nucleosomes  and 
that  recognize  additional  histones  (14).  At  least  some  of  the 
Gcn5p-associated  proteins  are  conserved  in  higher  eukaryotes, 
and  Gcn5-Ada  complexes  have  been  identified  in  human  cells 
(7),  further  indicating  that  these  enzymes  serve  similar  func¬ 
tions  across  species.  We  scanned  the  yeast  genome  database  to 
determine  whether  a  protein  homologous  to  the  amino-termi¬ 
nal  domains  of  mmGCNS  or  mmP/CAF  might  exist  that  could 
be  a  component  of  the  GcnSp-containing  complexes.  However, 
we  found  no  such  homologs. 

Interestingly,  a  single  GCA/5-related  gene  has  been  identi¬ 
fied  in  Drosophila.  This  gene  exhibits  high  similarity  to  mam¬ 


malian  PICAF  (34a)  and  encodes  the  extended  N-terminal 
domain.  Although  this  domain  is  apparently  not  needed  in 
yeast,  its  functions  are  not  restricted  to  mammals. 

Our  work  indicates  that  multiple  differentially  spliced  forms 
of  GCNS  transcripts  coexist  in  both  mouse  and  human  cells, 
which  may  generate  different  isoforms  of  GCN5  proteins.  Of 
course,  we  cannot  rule  out  the  possibility  that  the  less  abundant 
products  represent  incompletely  spliced  RNAs,  but  it  is  inter¬ 
esting  that  intron  6  and  the  stop  codons  therein  are  conserved 
between  human  and  mouse.  Since  we  detected  transcripts  con¬ 
taining  intron  6  in  both  mouse  and  human  tissues,  we  are 
intrigued  by  the  possibility  that  shortened  GCNS  proteins, 
containing  either  the  N-terminal  domain  alone  or  the  C-ter¬ 
minal  domain  alone,  may  provide  an  additional  level  of  regu¬ 
lation  of  GCNS  functions.  For  example,  we  detected  the  full- 
length  GCNS  protein  in  mouse  embryo  extracts  and  some 
human  cells,  but  we  detected  a  shorter,  less  abundant  protein 
in  HeLa  cells  in  addition  to  the  full-length  protein.  It  will  be 
especially  interesting  to  determine  whether  various  forms  of 
GCNS  proteins  are  differentially  regulated  in  different  cell 
types  or  at  different  developmental  stages. 

The  long  form  of  mmGCNS  is  very  similar  to  mmP/CAF  in 
structure,  in  acetyltransferase  activity  and  substrate  specificity, 
and  in  interactions  with  CBP/p300.  Additional  experiments  are 
needed  to  determine  whether  these  two  proteins  are  function¬ 
ally  redundant  in  vivo.  Even  if  GCNS  and  P/CAF  perform  the 
same  functions,  they  might  be  utilized  at  different  developmen¬ 
tal  stages  or  in  different  cell  types  or  tissues.  The  similarity 
between  these  proteins  is  somewhat  reminiscent  of  that  be¬ 
tween  CBP  and  p300.  These  two  proteins  also  appear  to  be 
functionally  equivalent  in  vitro,  but  mutations  in  p300  and  CBP 
cause  different  phenotypes  (27, 30),  indicating  that  the  proteins 
are  not  functionally  redundant  in  vivo.  It  will  be  interesting  to 
determine  whether  the  same  is  true  for  mmGCNS  and  mmP/ 
CAF. 

Several  histone  acetyltransfe rases,  including  p300/CBP  and 
P/CAF,  have  been  implicated  in  growth  control  and  tumori- 
genesis  (30,  32,  41).  p300/CBP  physically  interacts  with  the 
tumor  suppressor  pS3  and  potentiates  sequence-specific  DNA 
binding  and  transactivation  by  pS3  through  acetylation  of  its 
C-terminal  domain  (IS).  Moreover,  mutations  in  p300  have 
been  found  in  certain  colorectal  and  gastric  cancers  (27).  CBP 
mutations  are  also  involved  in  the  etiology  of  certain  acute 
myeloid  leukemias  and  Rubinstein-Taybi  syndrome  (30),  a  de¬ 
velopmental  disorder  with  a  high  incidence  of  neoplasms.  In 
addition,  P/CAF  counteracts  the  transforming  activity  elicited 
by  oncoprotein  ElA,  and  overexpression  of  P/CAF  has  been 
shown  to  inhibit  cell  cycle  progression  (41),  Therefore,  histone 
acetyltransferases  have  been  postulated  to  be  negative  regula¬ 
tors  of  cell  growth  and,  possibly,  tumor  suppressors.  We  (this 
study)  and  others  (9)  have  found  that  GCNS  cosegregates  with 
the  tumor  suppressor  BRCAl  gene  (1-centimorgan  interval)  in 
a  highly  syntenic  region  in  mouse  (chromosome  11)  and  hu¬ 
man  (chromosome  17).  Interestingly,  loss  of  heterozygosity  on 
human  chromosome  17  is  a  frequent  genetic  alteration  in  spo¬ 
radic  breast  and  ovarian  cancers,  where  mutations  in  BRCAl 
and  BRCA2  are  rarely  found  (28,  35).  Indeed,  a  novel  tumor 
suppressor  gene  involved  in  these  cancers  has  been  postulated 
to  be  located  adjacent  to  the  BRCAl  locus  (28,  35).  GCNS  may 
provide  an  attractive  candidate  for  this  novel  tumor  suppres¬ 
sor.  The  isolation  and  characterization  of  mmGCNS  and  mmPI 
CAF  reported  here  should  facilitate  further  study  of  the  roles 
of  these  genes  and  of  histone  acetylation  in  normal  mammalian 
development,  as  well  as  in  abnormal  events  leading  to  tumor- 
igenesis. 
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Histone  acetyltransferases  (HATs)  are  important  in  transcriptional  regulation  but  little  is 
known  about  the  role  of  these  enzymes  in  developmental  processes.  Two  mouse  HATs,  GCN5  and 
PCAF  (pSOO/CBP  Associated  Factor)  are  highly  related  in  sequence  and  both  interact  with 

p300/CBP^.  PCAF  is  thought  to  participate  in  many  cellular  processes  regulated  by  p300/CBP^'*. 
Here  we  show  that  PCAF  is  dispensable  in  mice,  and  thus  is  not  required  for  p300/CBP  functions. 
In  contrast,  GCN5  null  embryos  are  severely  growth  retarded  by  8.5  dpc  and  fail  to  form  dorsal 
mesoderm  lineages,  including  chordamesoderm  and  paraxial  mesoderm.  However,  differentiation 
of  extraembryonic  mesoderm  and  cardiac  mesoderm  appears  unaffected  in  GCN5  mutants. 
Strikingly,  loss  of  the  dorsal  mesoderm  lineages  appears  due  to  a  high  incidence  of  apoptosis  in  the 
GCN5  mutants  that  begins  before  the  onset  of  morphological  abnormality.  Moreover,  embryos  null 
for  both  GCN5  and  PCAF  show  even  more  severe  defects,  indicating  that  these  HATs  haye 
overlapping  functions  during  embryogenesis.  Our  studies  are  the  first  to  demonstrate  that 
acetyltransferases  are  important  for  cell  survival  and  mesoderm  formation  during  mammalian 
development. 

We  reported  previously  that  mouse  GCN5  and  PCAF  share  similar  sequences  and  enzymatic 
activities*.  Both  are  expressed  ubiquitously,  but  in  complementary  amounts,  in  different  adult  mouse 
tissues*.  Here  we  examined  the  expression  of  these  genes  during  development.  GCN5  expression  is  high 
at  7.5  days  post  coitum  (dpc)  and  persists  until  16.5  dpc  (Fig.  la).  In  contrast,  PCAF  expression  is  barely 
detectable  during  embryogenesis  (Fig.  lb).  Whole-mount  in  situ  analysis  shows  that  GCN5  is  expressed 
uniformly  throughout  the  embryo  from  7.5  -  9.0  dpc,  except  in  the  distal  allantois  and  the  developing 
heart  (Fig.  Ic).  GCN5  expression  is  also  absent  in  the  yolk  sac  in  8.5-9.0  dpc  embryos  (data  not  shown). 

To  decipher  the  roles  of  these  enzymes  in  mammalian  development,  we  generated  mice  lacking 
PCAF  or  GCN5  (Figs.  2  and  3).  Despite  the  multiple  functions  implicated  for  PCAF  by  tissue  culture 
experiments^'*,  mice  homozygous  null  for  PCAF  are  viable  and  fertile  with  no  obvious  abnormal 
phenotype.  In  addition,  neither  GCN5  mRNA  (Fig.  2c)  or  protein  levels  (data  not  shown)  were 
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upregulated  in  the  tissues  of  PCAF  null  mice,  although  a  small  (2  fold)  increase  may  not  be  apparent  in 
this  analysis. 

Mice  heterozygous  for  the  GCN5  null  allele  were  obtained  at  the  expected  frequency  and 
appeared  normal,  but  mice  homozygous  for  the  GCN5  null  mutation  died  in  utero.  around  10.5  dpc. 
Genotyping  of  7.5-10.5  dpc  embryos  obtained  from  heterozygous  matings  showed  Mendelian 
segregation  of  the  GCN5  mutant  allele  (data  not  shown).  Lack  of  GCN5  expression  in  the  mutant 
embryos  was  confirmed  by  whole-mount  in  situ  hybridization  (Fig.  3c),  and  GCN5  mutant  embryos 
originating  from  two  independently  targeted  ES  cell  lines  exhibited  identical  phenotypes. 

At  7.5  dpc,  GCN5  null  mutants  are  morphologically  indistinguishable  from  their  wildlype  or 
heterozygous  littermates  (Fig.  3d).  However  at  8.5  dpc,  GCN5  null  embryos  can  be  identified  by  their 
smaller  size  and  abnormal  morphology.  The  ratio  of  extraembryonic  to  embryonic  tissue  in  these 
embryos  is  aberrant,  suggesting  continued  proliferation  of  the  extraembryonic  tissues  and  arrest  of  the 
embryonic  tissues.  The  amnion  and  allantois  of  the  mutant  embryos  also  appear  morphologically  normal. 
The  GCN5  mutants  have  an  ectopic  structure  in  the  exocoelomic  cavity  that  projects  from  the  anterior 
end  of  the  embryo.  This  ectopic  structure  continues  to  grow,  eventually  reaching  the  chorion  and  the 
ectoplacental  cone  (Fig.  3d).  The  identity  or  significance  of  this  structure  is  unclear.  Histologically  it 
resembles  ectodermal  cells  (Fig.  4g,  4h,  4i),  but  does  not  express  any  of  the  neural  markers  (Otx2, 
engrailed  1  and  Hes5  )(data  not  shown),  or  the  mesodermal  lineage  markers  examined  (see  below). 
Strikingly,  the  embryos  never  develop  somites  and  rarely  have  a  recognizable  neural  tube.  Mutants 
survive  to  10.5  dpc  with  apparent  heartbeats  but  die  shortly  thereafter.  Histological  analysis  of  the  GCN5 
mutants  at  8.5  dpc  indicates  that  extraembryonic  tissues  such  as  the  chorion,  amnion  and  the  allantois  are 
normal  (Fig.  4g,  4i  and  4j).  However,  the  embryo  proper  is  retarded  (Fig.  4),  resembling  that  of  a  wild 
type  embryo  at  7.5  dpc.  Embryonic  mesoderm  and  neural  epithelium  are  present  in  the  GCN5  mutants 
(Fig.  4k,  41). 
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Since  the  GCN5  null  embryos  lack  specific  mesodermal  derivatives,  we  examined  the  expression 

of  several  mesodermal  lineage  markers  (Fig.  5).  Both  Brachyury^  and  Fgf8*®,  two  early  mesodermal 
markers  expressed  in  the  primitive  streak,  are  detected  in  the  mutant  embryos  at  8.25  dpc  (Fig.  5a,  5b),  in 
a  pattern  similar  to,  although  weaker  in  the  case  of  Brachyury,  than  that  of  wild  type  embryos.  These 
results  indicate  that  early  primitive  streak  mesoderm  forms  in  the  mutant  embryos,  consistent  with  the 
histological  analysis  above.  However,  when  we  looked  for  the  presence  of  mesodermal  derivatives  of  the 
primitive  streak,  we  identified  deficits  in  specific  lineages.  Expression  of  paraxis^  *  and  Dill  (Delta-like 
1)’^,  which  are  normally  expressed  in  the  presomitic  mesoderm,  were  abolished  in  8.5  dpc  mutants 
(Fig.5c,  5e),  indicating  that  formation  of  paraxial  mesoderm  is  disrupted  by  loss  of  GCN5.  Next,  we 

analyzed  the  expression  of  Shh*^  and  HNF-3p^'^>  markers  for  chordamesoderm.  In  the  majority  (see 
below)  of  GCN5  nulls,  no  expression  of  Shh  and  HNF-3P  was  observed  in  the  region  of  the  notochord 
(Fig.  5f,  5g),  suggesting  that  chordamesoderm  formation  is  also  perturbed  by  GCN5  loss.  HNF-3P 
expression  was  detected  in  the  distal  tip  arid  the  anterior  lateral  area  of  the  mutant  embryos,  which  likely 
represents  normal  expression  in  the  definitive  endoderm  and  node.  To  assess  whether  the  head 

mesenchyme  is  formed  properly,  we  examined  the  expression  of  twist^^’  Twist  expression  appeared 

normal  in  the  allantois  of  the  GCN5  mutants  but  was  barely  detectable  in  embryonic  tissues  (Fig.  5h), 
suggesting  that  head  mesenchyme  formation  is  also  compromised  in  the  absence  of  GCN5.  MEF2C,  an 
early  marker  for  cardiac  mesoderm^^,  was  expressed  in  the  GCN5  mutants  at  8.0  dpc  in  the  same  pattern 
as  in  the  wild  type  controls  (Fig.  5i).  This  result  is  consistent  with  our  observation  that  the  GCN5  mutant 
embryos  have  heartbeats  at  10.5  dpc  and  suggests  that  cardiac  mesoderm  formation  is  unaffected  in  these 
mutants. 

Taken  together,  the  above  results  indicate  that  while  early  mesoderm  is  formed  in  the  GCN5  null 
embryos,  GCN5  is  required  for  the  formation  or  maintenance  of  paraxial  mesoderm,  chordamesoderm 
and  head  mesenchyme.  Only  extraembryonic  and  cardiac  mesoderm  appear  relatively  normal. 
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The  lineage-specific  defects  in  mesoderm  formation  might  arise  from  a  lack  of  initial  mesoderrn 
specification  or  a  failure  to  maintain  the  specified  mesoderm.  Paraxis  is  expressed  as  early  as  7.5  dpc  in 
wild  type  embryos.  Therefore,  we  re-examined  paraxis  expression  at  this  stage  in  the  mutant  embryos,  to 
see  if  paraxis  might  be  expressed  transiently  before  morphological  defects  become  visible.  Expression  of 
paraxis  was  observed  in  GCN5  mutant  embryos  between  7.5  to  8.0  dpc  (Fig.5d).  This  observation 
indicates  that  paraxial  mesoderm  is  specified  but  does  not  survive  in  the  absence  of  GCN5.  To  see  if  this 
loss  is  due  to  a  decrease  in  cell  proliferation,  we  compared  the  expression  of  Proliferating  Cell  Nuclear 

Antigen  (PCNA),  which  is  enriched  during  S  phase  in  wild  type  and  GCN5  null  embryos. 
Expression  of  PCNA  in  GCN5  mutants  at  8.5  dpc  was  equal  to  or  higher  than  that  observed  in  wild  type 
embryos  at  7.5  or  8.5  dpc,  indicating  that  cell  proliferation  is  not  adversely  affected  by  GCN5  loss  (data 
not  shown).  We  then  asked  whether  the  loss  of  paraxial  mesoderm  might  reflect  increased  cell  death  by 
performing  TUNEL  assays  for  apoptosis.  At  7.5  dpc,  when  GCN5  mutants  are  morphologically 
indistinguishable  from  their  wildtype  littermates,  a  strikingly  high  degree  of  apoptosis  is  apparent  in 
sections  from  the  mutants  (Fig.  6c,  6d).  In  contrast,  apoptotic  cells  are  rarely  found  in  7.5  dpc  wildtype 
embryos  (6a,  6b).  At  8.5  dpc,  more  than  25%  of  cells  (Fig.  6f)  are  undergoing  apoptosis  in  the  mutants, 
whereas  only  about  1  %  of  cells,  mostly  clustered  in  the  midline  of  the  neural  tube  of  the  wild  type 
embryos  are  apoptotic  (Fig.6e).  Increased  cell  death  in  the  mutants  is  restricted  to  the  embryonic 
ectoderm  and  mesoderm  (Fig.  6c,  6d  and  6g).  These  results  suggest  that  increased  cell  death  is 
responsible  for  the  disappearance  of  dorsal  mesoderm  lineages  in  GCN5  mutants. 

Lastly,  we  generated  mice  harboring  compound  mutations  for  GCN5  and  PCAF.  Mice 
heterozygous  for  GCN5  and  null  for  PCAF  are  normal.  However,  mice  doubly  null  for  both  HATs  die 
around  7.5  dpc,  several  days  earlier  than  the  GCN5  mutants.  Double  mutants  are  severely  retarded,  and 
some  are  filled  with  blood,  indicating  that  they  are  moribund  (data  not  shown).  These  embryos  are 
observed  at  Mendelian  ratios  at  7.5  dpc,  but  were  no  longer  found  after  8.5  dpc.  Thus,  the  combined  loss 
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of  PCAF  and  GCN5  leads  to  even  more  severe  developmental  defects  than  observed  for  loss  of  GCN5 
alone,  indicating  that  GCN5  and  PCAF  share  overlapping  functions  during  early  embryogenesis. 

Our  studies  yield  several  important  conclusions.  First,  unexpectedly  in  light  of  results  from 
tissue  culture  experiments,  PCAF  is  dispensable  for  mouse  development  and  is  not  necessary  for  normal 

functions  in  the  adult  mouse.  Although  PCAF  physically  interacts  with  CBP  and  p30p’’  the  lack  of  a 
developmental  phenotype  for  the  PCAF  mutant  mice  demonstrates  that  PCAF  is  not  absolutely  required 

for  p300  or  CBP  functions  during  embryogenesis^^.  Second,  our  studies  show  that  GCN5  is  essential  for 
embryogenesis  and  that  absence  of  GCN5  results  in  loss  of  paraxial  mesoderm,  chordamesoderm,  and 
head  mesenchyme.  In  contrast,  extraembryonic  mesoderm  and  the  cardiac  mesoderm  are  relatively 
unaffected,  suggesting  that  GCN5  is  not  required  for  their  survival,  which  is  consistent  \yith  lack  of 

GCN5  expression  in  these  lineages.  Although  GCN5  physically  interacts  with  p300/CBP',  the 
phenotypes  of  mice  lacking  p300  or  GCN5  are  substantially  different^",  suggesting  that  GCN5  functions 
are  unique  and  are  not  merely  supplementary  to  that  of  p300/CBP.  Thus,  loss  of  different  HAT  activities 
during  development  leads  to  different  defects  in  embryogenesis.  Lastly,  our  studies  show  that  loss  of 
GCN5  leads  to  a  dramatic  increase  in  apoptosis  in  the  developing  embryonic  ectoderm  and  mesoderm. 
Since  we  observe  increased  apoptosis  before  we  observe  morphological  defects,  increased  apoptosis  is 
likely  to  be  a  primary  result  of  GCN5  loss,  and  the  defective  formation  of  mesodermal  lineages  likely 
reflects  increased  cell  death.  Occasionally  (about  10%  of  the  time),  we  observed  GCN5  mutants  that 
were  slightly  larger  with  a  more  apparent  head  fold  and  body  axis.  Notochord  staining  of  Shh  and  tiny 
patches  of  paraxis  expression  are  detected  in  a  subset  of  these  less  severe  mutants.  It  |s  possible  that  the 
heterogeneity  observed  in  the  GCN5  mutants  is  the  result  of  slight  variations  in  the  timing  of  onset  or  the 
extent  of  apoptosis,  which  in  turn  could  be  influenced  by  the  varied  C57BL6/J  and  129  backgrounds  of 
the  mutant  embryos.  Several  hypotheses  might  explain  the  requirement  of  GCN5  for  cell  survival. 

GCN5  might  be  required  for  the  expression  of  "anti-apoptotic"  genes  during  early  embryogenesis. 


6 


Alternatively,  histone  acetylation  by  GCN5  might  be  linked  to  checkpoint  pathways  that  monitor  genome 
integrity.  In  yeast,  mutation  of  histone  acetylation  sites,  or  loss  of  specific  acetyltransferases  or 

deacetylases,  leads  to  accumulation  of  cells  in  G2/M^^’  Disruption  of  the  RAD9  DNA  damage 
checkpoint  reverses  some  of  these  cell  cycle  defects  ’  ,  suggesting  that  a  balance  of  histone 

acetylation/deacetylation  may  be  necessary  to  maintain  genome  integrity  or  to  allow  proper  chromosome 
condensation  for  completion  of  mitosis.  Finally,  acetylation  of  non-histone  proteins  might  be  crucial  to 
regulating  apoptosis.  For  example,  both  PCAF  and  p300  acetylate  p53,  which  enhances  its  DNA-binding 

n  c 

activity.  Acetylation  of  p53  also  increases  in  response  to  DNA  damage  ’  .  One  scenario  might  be  that 
p53  is  acetylated  by  GCN5  early  in  development  and  that  loss  of  this  acetylation  in  the  GCN5  mutants 
triggers  apoptosis.  Such  possibilities  set  the  stage  for  future  studies. 


Methods 

PCAF  targeting.  We  isolated  a  portion  of  the  PCAF  gene  by  screening  a  mouse  genomic  library 
(Lambda  FIXII,  Stratagene).  One  clone  (19.5  Kb)  consists  of  two  exons,  which  are  127  bp  and  146  bp  in 
length  and  are  located  249  and  376  bp  downstream  of  the  translation  start  site  in  the  PCAF  coding 
sequence.  The  latter  exon  was  targeted.  A  SV40  poly-adenylation  sequence  was  inserted  into  the 
targeting  vector  between  the  5’  and  3’  homology  arms.  Upon  homologous  recombination,  the  targeted 
exon  was  replaced  by  the  poly-adenylation  sequence  and  the  PGK  Neo  cassette,  which  would 
prematurely  terminate  the  PCAF  transcript.  The  length  of  the  targeted  exon  is  not  a  multiple  of  three  bp, 
so  in  the  event  that  the  targeted  exon  is  skipped  by  splicing,  the  resulted  cDNA  would  be  out  of  frame  for 
the  PCAF  protein.  For  genotyping,  genomic  DNA  was  digested  with  PstI  or  Xbal,  and  probed  with  a  1.3 
Kb  Pstl-Xbal  fragment  or  a  0.65  Kb  BamHI  fragment  respectively  (Fig.  2). 

GCN5  targeting.  GCN5  genomic  sequence  that  covers  the  entire  GCN5  coding  region  was  isolated 
previously* .  A  1 .5  Kb  genomic  fragment  upstream  of  the  translation  start  site  was  used  as  5'  homology 
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arm.  A  5.5  Kb  fragment  was  excised  by  Xbal  and  Notl  to  generate  the  3’  homology  arm.  In  addition,  a 
LacZ  gene  was  inserted  into  the  GCN5  locus,  where  its  transcription  was  to  be  driven  by  the  GCN5 
regulatory  sequences  (Fig.  3).  Homologous  recombination  replaced  the  GCN5  coding  region  with  the 
LacZ  gene  and  the  PGK-Neo  cassette.  For  genotyping,  genomic  DNA  was  digested  with  EcoRV  and 
probed  with  a  0.6  Kb  Apal  fragment  5'  to  the  5'  homology  arm.  Similarly,  genomic  DNA  was  digested 
with  EcoRV  and  probed  with  a  0.5  Kb  PCR  generated  fragment  3'  to  the  3'  homology  arm.  PCR  primers 
used  to  generate  the  3'  probe  are:  5’-AGTAGGTGGTATGGCTTCTCAG;  3’- 
CTAGAAGGCTCAGGATACCATC. 

Creation  of  PCAF  or  GCN5  null  mice.  GCN5  and  PCAF  targeting  vectors  were  linearized  and 
electroporated  separately  into  129/SvEv  derived  ES  cells  ABl,  originally  generated  by  Dr.  A  Bradley 
(Baylor  College  of  Medicine).  G418  and  FIAU  double  resistant  cells  were  genotyped  by  Southern  blot 
analyses  using  GCN5  or  PCAF  probes  described  above.  Correctly  targeted  ES  cells  were  injected  into 
3.5  dpc  C57BL/6J  (Jackson  Lab)  blastocysts.  Male  chimeras  were  mated  with  wildtype  C57BL/6J 
females.  Germline  transmission  of  the  mutant  allele  was  determined  by  Southern  blot  analysis  of  tail 
genomic  DNA  using  the  probes  described  above.  Three  out  of  four  independent  PCAF  targeted  ES  cell 
lines  and  two  out  of  three  independent  GCN5  ES  cell  lines  gave  rise  to  germline  transmission. 
Heterozygous  mice  were  intercrossed  to  generate  homozygous  null  mice.  Embryos  younger  than  8.5  dpc 
were  genotyped  by  PCR.  Primers  for  GCN5:  5’-AACCTGACCCTGGAGGATGCCAAG>  3 - 
GTGACGGCACAGAAGACGATTTCC.  Primers  for  PCAF:  5'-AAGCTGCTCACGTTTCTCACTTGG; 
3'-GGGTAAGCTCTCACCTTGAATAGG.  Primers  forNeo:  5'-TCCTGCCGAGAAAGTATCCAT;  3'- 
AGGAAAGGACAGTGGGAGTGG.PrimersforLacZ:5'- 
GCATCGAGCTGGGTAATAAGGGTTGGCAAT;3'- 
GACACCAGACCAACTGGTAATGGTAGCGAC. 

Generation  of  PCAF  and  GCN5  double  mutants.  GCN5  heterozygous  mice  were  crossed  with  PCAF 
heterozygotes  to  create  mice  doubly  heterozygous  for  GCN5  and  PCAF.  Double  heterozygotes  were 
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mated  with  PCAF  null  mice  to  produce  mice  that  were  heterozygous  for  GCN5  and  null  for  PCAF.  Mice 
heterozygous  for  GCN5  and  null  for  PCAF  were  normal  and  intercrossed  to  generate  mice  that  were 
doubly  null  for  both  genes. 

RNA  analysis.  Total  RNA  from  whole  embryos  or  adult  mouse  tissues  was  isolated  and  electrophoresed 
on  1.1%  agarose  gels  containing  formaldehyde.  RNA  was  transferred  to  GeneScreen  Plus  membrane  , 

(NEN  Life  Science)  and  hybridized  with  GCN5  and  PCAF  cDNA  probes  as  previously  described  1. 
Histological  studies.  Mouse  embiyos  were  fixed  with  4%  paraformaldehyde  overnight  at  4°C,  and 
dehydrated  with  graded  alcohol.  Embryos  were  cleared  in  xylene,  incubated  in  1 : 1  xylene/paraffin  for  45 
min  at  65°C,  washed  3X  with  paraffin  30min  each  at  65°C,  and  embedded  in  paraffin.  Sections  were 
made  at  7  pm  and  stained  with  hematoxylin  and  eosin  according  to  standard  procedure. 

Whole-mount  in  situ  hybridization.  Whole  mount  in  site  hybridization  of  embryos  was  performed  as 
described  by  Hogan  et  al.  1994  Three  or  more  embryos  were  used  for  each  marker.  The  cDNAs  used 
to  generate  probes  used  have  been  described  previously^’  The  cDNAs  used  to  generate  probes 

for  paraxis*  ’  and  MEF2C’^  were  kindly  provided  by  Dr.  E.  Olson. 

TUNEL  assay.  TUNEL  assay  was  performed  on  paraffin  sections  using  the  ApopTag  Plus  Peroxidase  In 
situ  Apoptosis  Detection  Kit  purchased  from  Intergen  according  to  the  manufacturer’s  protocol. 
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Figure  1  Differential  expression  of  GCN5  and  PCAF  in  mouse  embryos,  a-b,  total  RNA  was 
isolated  from  mouse  embryos  at  the  stages  indicated.  Northern  blot  hybridization  was  carried  out 
using  either  GCN5  (a)  or  PCAF  (b)  specific  probes.  Ribosomal  S26  sequences  were  used  as  an 
internal  control,  c,  whole-mount  in  situ  hybridization  using  a  GCN5  specific  probe.  Arrows 
indicate  distal  allantois  (8.5  dpc  embryo)  and  the  heart  (9.0  dpc  embryo). 

Figure  2  Disruption  of  the  PCAF  gene,  a,  targeting  strategy  for  the  PCAF  gene.  A  5’  exon 
(shown  in  black)  in  the  PCAF  gene  was  deleted  by  homologous  recombination  and  replaced  with 
a  SV40  polyadenylation  cassette  and  the  phosphoglycerate  kinase  (PGK)-Neo  gene.  The  size  and 
position  of  the  targeted  exon  is  indicated.  Positions  of  the  5’  and  3 ’probes  for  Southern  blot 
analyses  are  shown.  Arrows  mark  the  orientation  of  the  thymidine  kinase  (TK),  PGK-Neo,  or 
PolyA  cassettes.  HAT:  acetyltransferase  catalytic  domain.  Bromo:  the  bromodomain.  P:  PstI;  X: 
Xbal.  b,  Genotyping  of  PCAF  targeted  mice  by  Southern  blot  hybridization.  Mouse  tail  genomic 
DNA  isolated  from  wildtype,  PCAF  heterozygous  and  homozygous  mutant  mice  were  digested 
with  Xbal  and  hybridized  with  3’  probe  shown  in  panel  a.  The  genotypes  of  the  mice  are 
indicated  on  top  of  the  panel.  The  sizes  of  the  wildtype  and  mutant  bands  are  indicated,  c,  Total 
RNA  was  isolated  from  tissues  of  wildtype  and  PCAF  null  mice.  Northern  blot  hybridization  was 
performed  using  a  PCAF  probe  that  is  3’  to  the  targeted  exon.  The  same  blot  was  subsequently  re¬ 
probed  with  a  GCN5  specific  probe  and  an  internal  control  S26  probe.  The  positions  of  PCAF, 
GCN5  and  S26  signals  are  indicated  by  arrows. 

Figure  3  GCN5  null  mice  die  during  embryogenesis.  a,  targeting  strategy  for  disruption  of  the 
mouse  GCN5  gene.  By  homologous  recombination,  the  entire  GCN5  coding  region  was  removed 
and  replaced  with  a  promoterless  lacZ  gene,  which  is  placed  under  the  control  of  the  GCN5 
promoter.  ATG  marks  the  translation  start  site  of  the  GCN5  coding  sequence.  Arrows  indicate  the 
orientation  of  the  lacZ  and  PGK-Neo  genes.  RV:  EcoRV.  b,  Genotyping  of  GCN5  targeted  mice 
by  Southern  blot  hybridization.  Genomic  DNA  was  isolated  from  wildtype  and  GCN5 
heterozygous  mutant  mice  as  indicated  and  digested  with  EcoRV.  Southern  blot  hybridization 
was  carried  out  using  the  5’  probe  shown  in  panel  a.  The  size  and  position  of  the  wildtype  and 
mutant  bands  are  indicated  by  arrows,  c,  whole-mount  in  situ  hybridization  of  a  wildtype  and 
GCN5  null  embryos  using  a  GCN5  riboprobe.  GCN5  is  widely  expressed  in  a  7.5  dpc  wildtype 
embryo,  but  not  in  a  GCN5  null  littermate.  d,  morphology  of  GCN5  null  embryos  at  7.5,  8.5  and 
9.0  dpc.  Genotypes  of  the  GCN5  alleles  are  indicated  below  the  embryos,  dpc:  days  post-coitum; 
es:  ectopic  structure;  al:  allantois. 


Figure  4  Histological  analysis  of  GCN5  mutant  embryos.  Paraffin  sections  were  prepared  from 
GCN5  mutant  embryos  and  control  littermates,  and  stained  with  haematoxylin-eosin.  a-f: 
transverse  sections  (proximal  to  distal)  of  a  8.5  dpc  wildtype  embryo,  g-1:  transverse  sections 
(proximal  to  distal)  of  a  8.5  dpc  GCN5  null  embryo,  al;  allantois;  am:  amnion;  hf:  head  fold;  he: 
heart;  gd:  gut  diverticulum;  so:  somite;  es:  ectopic  structure;  ch:  chorion;  hm:  head  mesoderm;  ne: 
neural  epithilium;  me:  mesoderm. 

Figure  5  Marker  gene  expression  in  GCN5  null  embryos.  Whole-mount  in  situ  hybridization  was 
performed  to  examine  the  expression  of  marker  genes.  In  all  panels  the  embryo  on  the  left  is  a 
wildtype,  the  one  on  the  right  is  a  GCN5  null.  Marker  genes  are  indicated  at  the  upper-left  corner 
of  each  panel.  Orientation  of  the  embryos  within  one  panel  is  consistent,  but  may  vary  in  different 
panels,  al:  allantois;  cm:  cardiac  mesoderm;  hm:  head  mesoderm;  ntc:  notochord;  ps:  primitive 
streak;  psm:  presomitic  mesoderm;  so:  somites,  a,  Brachyury  is  detected  at  the  posterior  end  of  a 
8.25  dpc  mutant  in  a  pattern  similar  to  that  of  the  wildtype  littermate,  but  at  reduced  levels,  b, 
FgfS  is  expressed  at  the  posterior  end  of  a  8.25  dpc  mutant  in  a  pattern  similar  to  that  of  the 
wildtype  littermate.  c,  Paraxis  is  expressed  in  the  presomitic  mesoderm  and  somites  of  a  8.5  dpc 
wildtype  embryo,  but  its  expression  is  absent  in  a  GCN5  mutant  littermate.  d,  Paraxis  is 
expressed  in  the  paraxial  mesoderm  of  a  7.75  dpc  GCN5  null  embryo,  but  its  expression  is 
reduced  compared  to  a  wildtype  littermate.  e.  Dll  1  is  expressed  in  the  presomitic  mesoderm  of  an 
8.5  dpc  wildtype  embryo,  but  not  in  a  GCN5  mutant  littermate.  f,  sonic  hedgehog  (shh)  is 
expressed  in  the  notochord  and  floor  plate  of  a  8.5  dpc  wildtype  embryo,  but  not  in  the  GCN5 
mutant  littermate.  g,  HNF3P  expression  is  detected  in  the  notochord  of  an  8.5  dpc  wildtype 
embryo.  HNF3P  expression  is  detected  at  the  distal  tip  and  ventral  lateral  region  of  the  mutant 
littermate,  but  not  in  the  notochord  region,  h.  Twist  is  expressed  in  the  head  mesenchyme, 
somites  and  allantois  of  an  8.5  dpc  wildtype  embryo.  Twist  is  expressed  in  the  allantois  of  the 
mutant  littermate,  but  only  very  weak  signal  is  detected  in  the  ectopic  structure  and  the  dorsal  half 
of  the  mutant  littermate.  i,  MEF2C  is  expressed  in  the  primitive  cardiac  mesoderm  of  an  8.0  dpc 
wildtype  and  its  mutant  littermate. 

Figure  6  Apoptosis  is  significantly  increased  in  GCN5  null  embryos.  Paraffin  sections  were 
prepared  from  7.5  dpc  wildtype  embryos  (a,  al  and  b),  7.5  dpc  GCN5  null  embryos  (c  and  d),  8.5 
dpc  wildtype  embryos  (e,  el  and  e2)  and  8.5  dpc  GCN5  mutants  (f,  fl,  g).  Extraembryonic  tissues 
of  7.5  dpc  embryos  from  heterozygous  matings  were  isolated  for  genotyping,  and  the  remaining 


embryonic  portions  of  the  embryos  were  processed  for  TUNEL  assays.  Apoptotic  cells  are 
brown.  Boxed  areas  in  panels  a,  e  and  f  are  magnified  in  panels  al,  el  e2,  and  fl.  ec:  ectoderm; 
me:  mesoderm;  en:  endoderm. 
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